Method and apparatus for performing partial unscan and near full scan within design for test applications

ABSTRACT

A computer implemented process and system for effectively determining a set of sequential cells with a integrated circuit design that can be scan replaced (e.g. for design for test applications) to offer significant testability while still maintaining specified optimization (e.g., area and/or timing) constraints that are applicable to the design. The novel system selects sequential cells for scan replacement that offer best testability contribution while not selecting sequential cells for scan replacement that do not offer much testability contribution and/or are part of most critical paths within the design. The novel system is composed of a subtractive method and an additive method. The subtractive method inputs a fully scan replaced netlist (e.g., the sequential cells are scan replaced) that does not meet determined optimization constraints. The novel subtractive system unscans selected cells until the area and/or timing constraints are met. A flag indicates whether nor not timing is considered. Selection for unscanning is based on a testability cell list (TCL) that ranks cells by their degree of testability contribution; those cells with low degrees of testability are unscanned first. The additive process receives an unscanned netlist (original design) and scan replaces cells, using the TCL ranked list until optimization (e.g., area and/or timing) constraints of the design are violated. A flag indicates whether nor not timing is considered. The additive system iterates through the TCL list with the cells offering the most contribution for testability scan replaced first.

This is a continuation of copending application Ser. No. 08/649,788filed on May 17, 1996, now U.S. Pat. No. 5,696,771 which is herebyincorporated by reference to this specification which designated theU.S.

BACKGROUND OF THE INVENTION

(1) Field of the Invention

The present invention relates generally to the field of logic synthesisfor integrated circuit devices. More particularly, aspects of thepresent invention relate to methods and apparatus for design for testwithin a logic synthesis system.

(2) Background of the Related Art

Complex integrated circuits are designed with the use of computer aideddesign (CAD) tools. Specifically, application specific integratedcircuits (ASICs) and field programmable gate array (FPGA) circuits canbe designed using a variety of CAD tools. The development of ASICs andFPGA circuits with the aid of CAD tools is referred to as electronicdesign automatic or EDA. Design, checking and testing of large scaleintegrated circuits are so complex that the use of programmed computersystems are required for realization of normal circuits. This is partlybecause the integrated devices are inherently complex and partly becausethe circuit design needs to be decomposed into simpler functions whichare recognized by the CAD tool. It is also partly because considerablecomputation is required in order to achieve an efficient layout of theresultant network. The result of the computerized design process is adetailed specification defining a complex integrated circuit in terms ofa particular technology. This specification can be regarded as atemplate for the fabrication of the physical embodiment of theintegrated circuit using transistors, routing resources, etc.

Integrated circuit designs can be represented in different levels ofabstraction, such as the register transfer level (RTL) and the logicallevel, using a hardware description language (HDL), also called highlevel design language. Two exemplary forms of HDL are Verilog and VHDL.The integrated circuit can be represented by different layers ofabstractions (e.g., behavioral levels, structural levels and gatelevels). An RTL level is an intermediary level of abstraction betweenthe behavioral and structural levels. HDL descriptions can representdesigns of all these levels.

The behavior levels and RTL levels consist general ly of descriptions ofthe circuit expressed with program-like constructs, such as variables,operators conditional loops, procedures and functions. At the logiclevel, the descriptions of the circuit are expressed with Booleanequations. The HDL can be used along with a set of circuit constraintsas an input to a computer implemented compiler (also called a "siliconcompiler"). The computer implemented compiler program processes thisdescription of the integrated circuit and generates therefrom a detailedlist of logic components and the interconnections between thesecomponents. This list is called a "netlist." The components of a netlistcan include primitive cells such as full-adders, NAND gates, NOR gates,XOR gates, latches and D-flip flops, etc. and their interconnectionsused to form a custom design.

In processing the HDL input, the compiler first generates a netlist ofgeneric primitive cells that are technology independent. The compilerthen applies a particular cell library to this generic netlist (thisprocess is called mapping) in order to generate a technology dependentmapped netlist. The mapping process converts the logical representationwhich is independent of technology into a form which is technologydependent. The mapped netlist has recourse to standard circuits, orcells which are available within a cell library forming a part of thedata available to the computer system.

Compiler programs and mapping programs are well known in the art andseveral of these systems are described in U.S. Pat. No. 5,406,497, byAltheimer et al.

An important part of the logic synthesis process involves designing fortestability. Programs that aid in the testability process of logicsynthesis are called design for test (DFT) processes. As part of DFT, itis known to take the mapped netlist generated from a compiler and addand/or replace certain memory cells and associated circuitry withspecial memory cells that are designed to allow the application of testvectors to certain logic portions of the integrated circuit. The act ofapplying test vectors is called stimulation of the design and thespecial memory cells and associated circuitry are referred to as DFTimplementations. Issues concerning controllability deal withfacilitating the application of the test vectors to the circuitry to betested. The same memory cells can be used to capture the output of thecircuitry for observation and compare this output to the expected outputin an effort to determine if circuit (e.g., manufacturing) defects arepresent.

The portions of an integrated circuit that are designed to perform itsintended or expected operational function are called its "mission mode"circuitry while the portions added to the integrated circuit tofacilitate testability are called "test mode" circuitry or DFTimplementations. The resultant circuit therefore has two functionalmodes, mission and test.

An exemplary flow chart diagram of a typical logic synthesis process,including a DFT process, is shown in FIG. 1. The processes 200 describedwith respect to this flow chart is implemented within a computer systemin a CAD environment. High level design language (HDL) descriptions ofthe integrated circuit enter at block 201. Also accompanying the HDL 201is a set of performance constraints 205 applicable to the design whichtypically include timing, area, power consumption, and other performancerelated limitations that the compiler 225 will attempt to satisfy whensynthesizing the integrated circuit design. Constraints 205 can alsoinclude non-performance related constraints such as structural androuting constraints. Compiler 225 consists of a generic compiler 203(also called an HDL compiler, RTL synthesizer, or architecturaloptimizer) that inputs the HDL 201 description and generates therefrom atechnology independent or "generic" netlist 207 which is also dependenton the constraints 205. As discussed above, the netlist 207 is a list oftechnology independent components or operators and the interconnectionsbetween there.

The generic netlist 207 is then input to a design compiler 209 thatincludes a computer implemented logic optimization procedure and amapping procedure which interfaces with a technology dependent celllibrary 230 (e.g., from LSI, VLSI, TI or Xilinx technologies, etc.). Thecell library 230 contains specific information regarding the cells ofthe specific technology selected such as the cell logic, number ofgates, area consumption, power consumption, pin descriptions, etc., foreach cell in the library 230. Logic optimization procedure of block 209includes structuring and flattening procedures. The mapping procedure ofblock 209 generates a gate level mapped netlist 211 that is technologydependent having cells specifically selected to satisfy the constraints205. This gate level netlist 211 consists at this point of "missionmode" circuitry.

At block 212 of FIG. 1, DFT process 213 performs a particular testinsertion process (here a scan) to implement testability cells or "testmode" cells into the overall integrated circuit design. In this process213, memory cells of the mapped netlist 211 are replaced with memorycells that are specially designed to apply and observe test vectors orpatterns to and from portions of the integrated circuit. In oneparticular DFT process, these memory cells specially designed for testare called scannable memory cells. The test vector patterns can bederived from combinational or sequential automatic test patterngeneration (ATPG) processes depending on whether or not a full orpartial scan is performed by the scan insertion process 213. Process 213also performs linking groups of scannable memory cells into scan chainsso that the test vectors can be cycled into and out of the integratedcircuit design. The output of the scan insertion process 213 is ascannable netlist 215 that contain., both mission and test modecircuitry.

A problem occurs in the prior art process of FIG. 1 in that the scaninsertion process 213 does not take into account its impact on themission mode design. Specifically, the addition of the testability cells(scannable cells), and interconnections there between (chainingresources), and the addition of other dedicated connections required foroperation of the scan chains (e.g., scan clock routing and scan enablesignal routing) can cause the overall design to violate one or more ofthe defined constraints 205.

Therefore, a second compile process 217 of FIG. 1 (full or incrementalcompile) is invoked by the prior art process 200 in order to moreeffectively optimize the scannable netlist 215 to the constraints 205.An incremental compile 217 does not process all existing structure as ina full compile, it only applies high level logical optimization to theunmapped portions of the design. Those unmapped portions are then mappedusing a technology dependent library. During a process iteration, anincremental compile 217 always processes to decrease the circuit cost.However, although this second compile process 217 is only an incrementalcompile process, it applies mapping optimizations iteratively on theentire scannable netlist 215. As a result, processing time to performthe second compile process 217 can be on the order of weeks givenconventional CAD technology and circuit complexity.

Alternatively, many prior systems utilize a full compile as the secondcompile process 217. The full compile process is similar to process 225in that the full compile process at 217 applies mapping and logicoptimizations to the entire design, not just the unmapped portions.

After the second compile process 217 of FIG. 1 completes, a scannablenetlist 219 is again generated that contains the testability cells butthat may or may not meet the original performance constraints 205.Therefore, at block 221, the prior art then performs a test to determineif the scannable netlist 219 meets the constraints 205. If the netlist219 meets the constraints, then at block 235, other circuit synthesisprocedures, continue until the integrated circuit design can befabricated onto a substrate and tested.

However, as is often the case, the addition of the testability cells bythe scan insertion process 213 does not allow the second compile process217 to meet constraints 205 without a design modification to theoriginal HDL program 201. In such case, the overall process 200 flowsfrom block 221 back to the HDL 201 where the architect modifies the HDLprogram 201 so that the addition of the testability cells and otherresources will eventually satisfy, when possible, the given constraints205 after the incremental compile step 217 is again executed.

The prior art process 200 of FIG. 1 has several disadvantages. It isdisadvantageous to execute a second substantial compile process 217 inan attempt to match the testability cells and linking resources to thegiven set of constraints. Although this process can be an incrementalcompile step in that much of the gate level connections are not removed,mapping optimization portions of this compile process still operate inan iterative fashion over the entire design. The addition of this secondcompile process, using conventional technology, delays the overallintegrated circuit synthesis process by as much as one to two weeks.Even after this long delay, there are no guarantees that the incrementalcompile process 217 will generate a scannable netlist satisfying theconstraints 205. In this case, a time consuming task of returning to theHDL for redesign is required. This process involves the chip architectdesigners once more and, therefore, it is unclear under the prior artsystem when a designer can sign off on his or her work in the designprocess.

Another problem faced by prior art designs involving the introduction ofscan cells for testability while maintaining optimization constraints(e.g., timing and area constraints) is that the timing and areaconstraints cannot be met in some designs if the entire design is scanreplaced. This is true no matter how many conventional compile processesare executed after the scan insertion block. Therefore, it would bedesirable to determine a set of sequential cells that can be scanreplaced to just meet the timing and area constraints while offeringsignificant testability for the design. What is needed is a system thateffectively determines a set of sequential cells within a design thatcan be scan replaced while satisfying given timing and area constraintsof the design. The present invention provides this functionality.Further, what is needed is a system that can perform the above based oniterations through determined critical paths of the design. The presentinvention additionally provides this functionality.

Accordingly, the present invention advantageously provides a system foreffectively determining the amount of sequential cells within a designthat can be scan replaced while satisfying given timing and areaconstraints of the design and still offering significant testability forthe design. It is an object of the present invention to provide theabove within a selected set of scan cells that attempts to offer a highdegree of testability given the timing and area constraints to besatisfied. It is an object of the present invention to provide asubtractive system for performing the above wherein a fully scanreplaced netlist (that violates timing and area constraints) is inputand selected cells are unscanned until the timing and area constraintsare met. It is another object of the present invention to provide anadditive system wherein an unscanned netlist is received, and using acell based or a critical path based system, cells are scanned that donot make the timing of the original system any worse than originallysubmitted until a significant number of sequential cells are scanreplaced or area constraints are violated. These and other objects ofthe present invention not specifically recited above will become clearwithin discussion of the present invention herein.

SUMMARY OF THE INVENTION

A computer implemented process and system are described for effectivelydetermining a set of sequential cells within a integrated circuit designthat can be scan replaced (e.g. for design for test applications) tooffer significant testability while still maintaining specified timingand area constraints that are applicable to the design. The novel systemselects sequential cells of the set for scan replacement that offer besttestability contribution while not selecting sequential cells for scanreplacement that do not offer much testability contribution and/or arepart of most critical paths within the design.

The novel system is composed of a subtractive method and an additivemethod that individually operate on different netlist types. Thesubtractive method inputs a fully scan replaced netlist (e.g., thesequential cells are call scan replaced) that does not meet determinedoptimization (e.g., area and/or timing) constraints. The subtractiveprocess of the present invention can receive input, for example, fromthe output of a test ready compiler (TR) also of the present invention.The novel subtractive system unscans selected scannable cells until thetiming constraints are met if a timing critical flag is set by the user.Additional cells are unscanned if area constraints are violated.Selection for unscanning is based on a testability cell list (TCL) thatranks cells by their degree of testability contribution; those cellswith low degrees of testability are unscanned first. The additiveprocess of the present invention receives an unscanned netlist (the"original design") and scan replaces cells using the TCL until areaconstraints are violated or, if a timing-critical flag is set, until theperformance of the design having the scan replaced cells are worse thanthe original design. An unscanned netlist for the additive process canbe output from an conventional compiler or can be an imported netlist.The additive system iterates through the TCL list with the cellsoffering the most contribution for testability scan replaced first.Cells on critical paths, or subcritical paths that become critical whenthe cells are replaced, are not replaced if the user has asserted atiming-critical flag.

Specifically, embodiments of the present invention include, in acomputer system having a processor coupled to a bus and a memory coupledto the bus, a computer implemented subtractive method of generating anetlist having scannable sequential cells and satisfying determinedoptimization constraints (e.g., timing and area constraints), the methodcomprising the computer implemented steps of: receiving a ranked listordering sequential cells by their contribution to testability;receiving a fully scan replaced input netlist including scannablesequential cells, the input netlist not satisfying one of the determinedoptimization constraints (e.g., area and/or timing); if timingconstraints are violated and the user has asserted a timing-criticalflag, determining a set of critical paths within the input netlist byperforming a timing analysis on the input netlist and selecting aselected critical path of the set of critical paths and identifying afirst and a second sequential cell located on either end of the selectedcritical path and if both sequential cells are scannable, determining,using the ranked list, which sequential cell of the selected criticalpath contributes least to testability and unscanning that sequentialcell; repeating the above while critical paths with scannable sequentialcells exist within the set of critical paths; and provided areaconstraints are violated, continue unscanning scannable cells thatcontribute least to testability until either area constraints are met oruntil there are no more scannable cells in the netlist.

Embodiments further include the above and wherein if only one sequentialcell of the first and second sequential cells is scan replaced,unscanning it regardless of the ranked list. Embodiments further includethe above and wherein the input netlist includes a loopback connectionassociated with each scannable sequential cell. Embodiments of thepresent invention include the above and wherein the ranked listcomprises a list of sequential cells, each sequential cell having a ranknumber identifying its relative contribution to testability, and whereinsequential cells are determined to contribute least to testability bytheir rank number within the ranked list. The present invention alsoincludes a computer system implemented in accordance with the above.

Embodiments of the present invention also include, in a computer system,an additive method of generating a netlist having scannable sequentialcells and satisfying determined optimization constraints (e.g., timingand area constraints), the method comprising the computer implementedsteps of: (a) accessing a ranked list ordering sequential cells by theircontribution to testability; (b) accessing an unscanned input netlistincluding unscanned sequential cells; (c) selecting a selected unscannedsequential cell from the ranked list starting from an end of the rankedlist that identifies sequential cells having high contributions totestability; (d) scanning the selected unscanned sequential cell,provided the step of scanning does not worsen timing characteristics orviolate timing constraints of the input netlist, wherein the step (d)further comprises the steps of: (1) copying the input netlist togenerate an input netlist copy; (2) scanning the selected unscannedsequential cell within the input netlist copy; (4) determining the worstcritical path of the copy of the input netlist; (5) summing he areas ofeach logic and routing element of the input netlist copy to determine anarea of the input netlist copy; (6) scanning the selected unscannedsequential cell within the input netlist provided the worst criticalpath of the input netlist copy is not worse than the worst critical pathof the input netlist and provided the area of the input netlist copydoes not violate the area constraints; (e) selecting a next selectedunscanned sequential cell from the ranked list; and (f) repeating steps(d)-(e) for each unscanned sequential cell within the ranked list. Thepresent invention also includes a computer system implemented inaccordance with the above.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flow diagram illustrating a prior art process for logicsynthesis with design for test implementations.

FIG. 2 is an exemplary computer system used in accordance with thepresent invention as a CAD system for design synthesis.

FIG. 3A is a diagram of a logic model including combinational logic andmemory cells used by the present invention to represent the design of asynthesized integrated circuit

FIG. 3B illustrates replacement performed by the present invention fromnon-scan memory cells specified by or inferred from the original HDLdescription into scannable memory cells used for DFT implementation.

FIG. 4 illustrates a procedure of the present invention (Design RuleChecker) that determines valid scan chains and marks as violated anyscannable memory cell not part of a valid scan chain.

FIG. 5A illustrates an exemplary edge triggered D flip-flop used in theHDL description in an embodiment of the present invention.

FIG. 5B illustrates an exemplary scannable memory cell version of thememory cell presented in FIG. 5A according to an embodiment of thepresent invention.

FIG. 5C illustrates an exemplary edge triggered memory cell used in theHDL description that also contains other combinational logic in additionto the memory cell circuitry.

FIG. 5D illustrates an exemplary scannable memory cell version of thememory cell presented in FIG. 5C according to an embodiment of thepresent invention including other combinational logic present.

FIG. 6A illustrates an exemplary circuit implementation of a loopbackline added by the test ready (TR) compiler of the present invention tosimulate a linked scan chain where the loopback is taken from the output(Q).

FIG. 6B illustrates an exemplary circuit implementation of a loopbackline added by the TR compiler of the present invention to simulate alinked scan chain where the loopback is taken from the inverted output(Q).

FIG. 7 illustrates a task performed by the modified scan insertionprocess of the present invention where the scan chain is buffered inlinks that span more than one module allowing the loopback lines toaccurately simulate this condition.

FIG. 8 is an overall flow diagram of the embodiments of the presentinvention starting from HDL description to the generation of testvectors having a scannable gate level netlist.

FIG. 9 is a flow diagram illustrating processes of the test ready (TR)compiler of the present invention.

FIG. 10 is an overall flow diagram illustrating processes of themodified scan insertion process of the present invention including theconstraint driven compile process.

FIG. 11A and FIG. 11B represent a flow diagram illustrating processes ofthe constraint driven compile process of the modified scan insertionprocess of the present invention.

FIG. 12 is a flow diagram illustrating processes of the size designprocess of the constraint driven compile process of the presentinvention.

FIG. 13A and FIG. 13B illustrate an exemplary circuit transitionperformed by the phasing subprocess of the size design process shown inFIG. 12.

FIG. 14A and FIG. 14B illustrate an exemplary circuit transitionperformed by the buffering subprocess of the size design process shownin FIG. 12.

FIG. 15A and FIG. 15B illustrate an exemplary circuit transitionperformed by the downsizing subprocess of the size design process shownin FIG. 12.

FIG. 16A and FIG. 16B illustrate an exemplary circuit transitionperformed by the isolation subprocess of the size design process shownin FIG. 12.

FIG. 17A and FIG. 17B illustrate an exemplary circuit transitionperformed by the offloading subprocess of the size design process shownin FIG. 12.

FIG. 18A and FIG. 18B illustrate an exemplary circuit transitionperformed by the balancing subprocess of the size design process shownin FIG. 12.

FIG. 19A and FIG. 19B illustrate an exemplary circuit transitionperformed by the splitting subprocess of the size design process shownin FIG. 12.

FIG. 20 illustrates the present invention test ready (TR) compiler andmodified scan insertion process within a hierarchical designillustrating the practical nature of the present invention on a chiplevel netlist.

FIG. 21A illustrates a top level flow diagram of the novel subtractive(e.g., partial) scan procedure of the present invention.

FIG. 21B illustrates a top level flow diagram of the novel additive(e.g., near full) scan procedure of the present invention.

FIG. 22A is a flow diagram of steps of the novel subtractive system ofthe present invention.

FIG. 22B illustrates a sample critical path (bounded by rankedsequential cells) involved in the processing of the novel subtractivesystem of the present invention.

FIG. 23A is a flow diagram of initial steps of the novel additive systemof the present invention.

FIG. 23B is a flow diagram of steps of the novel additive system of thepresent invention.

FIG. 24 is a flow diagram illustrating an embodiment of the presentinvention using near full scan as a front end for the partial unscanprocess.

DETAILED DESCRIPTION OF THE INVENTION

In the following detailed description of the present invention numerousspecific details are set forth in order to provide a thoroughunderstanding of the present invention. However, it will be obvious toone skilled in the art that the present invention may be practicedwithout these specific details. In other instances well known processes,methods, procedures, components, and circuits have not been described indetail as not to unnecessarily obscure aspects of the present invention.

Some portions of the detailed descriptions which follow are presented interms of procedures, processes, and symbolic representations ofoperations on data bits within a computer memory. These proceduredescriptions and representations are the means used by those skilled inthe data processing arts to most effectively convey the substance oftheir work to others skilled in the art. A procedure, process, or logicblock is here, and generally, conceived to be a self-consistent sequenceof steps leading to a desired result. The steps are those requiringphysical manipulations of physical quantities. Usually, though notnecessarily, these quantities take the form of electrical or magneticsignals capable of being stored, transferred, combined, compared, andotherwise manipulated. It has proven convenient at times, principallyfor reasons of common usage, to refer to these signals as bits, values,elements, symbols, characters, terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar termsare to be associated with the appropriate physical quantities and aremerely convenient labels applied to these quantities. Unlessspecifically stated otherwise as apparent from the followingdiscussions, it is appreciated that throughout the present invention,discussions utilizing terms such as "processing" or "computing" or"calculating" or "'dettmining" or "displaying" or executing a procedureor the like, refer to the action and proc sses of a computer system, orsimilar electronic computing device, that manipulates and transformsdata represented as physical (electronic) quantities within the computersystem's registers and memories into other data similarly represented asphysical quantities within the computer system memories or registers orother such information storage, transmission or display devices.

Specific aspects of the present invention are operable within aprogrammed computer aided design (CAD) system. A CAD system operable toimplement the elements of the present invention is shown in FIG. 2. Ingeneral, the CAD system of the present invention includes a computersystem 112 which includes a bus 100 for communicating informationincluding address, data, and control signals, a central processor 101coupled with the bus 100 for processing information and instructions, arandom access memory 102 coupled with the bus 100 for storinginformation and instructions for the central processor 101, a read onlymemory 103 coupled with the bus 100 for storing static information andinstructions for the processor 101, a data storage device 104 such as amagnetic or optical disk and disk drive coupled with the bus 100 forstoring information and instructions, a display device 105 coupled tothe bus 100 for displaying information to the computer user, analphanumeric input device 106 including alphanumeric and function keyscoupled to the bus 100 for communicating information and commandselections to the central processor 101, a cursor control device 107coupled to the bus for communicating user input information and commandselections to the central processor 101, and a signal generating device108 coupled to the bus 100 for communicating signals that are input andoutput from the system 112.

Program instructions executed by the CAD system can be stored in RAM102, ROM 103, or in the storage device 104 and when executed in a groupcan be referred to as logic blocks or procedures. It is appreciated thatdata produced at the various logic synthesis stages of the presentinvention, including representations of the different levels ofabstraction of the integrated circuit design, can also be stored in RAM102, ROM 103 or the storage device 104 as shown in FIG. 2.

The display device 105 of FIG. 2 utilized with the computer system 112of the present invention may be a liquid crystal device, cathode raytube, or other display device suitable for creating graphic images andalphanumeric characters recognizable to the user. The cursor controldevice 107 allows the computer user to dynamically signal the twodimensional movement of a visible pointer on a display screen of thedisplay device 105. Many implementations of the cursor control deviceare known in the art including a trackball, mouse, joystick or specialkeys on the alphanumeric input device 105 capable of signaling movementof a given direction or manner of displacement.

FIG. 3A illustrates a circuit model 300 utilized by the presentinvention to represent a logic unit of an integrated circuit. The model300 includes a memory cell block 301 that outputs signals over line 311to a combinational logic block 305. Combinational logic 305 also outputssignals over line 313 to drive inputs of the memory cells 301. Memory301 receives a primary input signal 307 which typically originates offchip. Combinational logic block 305 also receives a primary input signal309 which typically originates off chip. Memory block 301 generates aprimary output signal 315 that goes off chip and combinational logicblock 305 also generates a primary output signal 317 that goes off chip.According to model 300, memory block 301 is composed either of edgesensitive clocked memory cells (e.g., flip-flops) or level sensitivememory cells (e.g., latches or registers) or can be composed of othercell types. A variety of memory modes or styles, including either of thememory modes described above, are acceptable within the presentinvention. For purposes of explanation, the memory cell style adoptedwithin discussions herein is the edge triggered flip-flop but thepresent invention is equally adapted for other styles such as levelsensitive modes, e.g., Level Sensitive Scan Design (LSSD) modes, clockedscan modes, clocked LSSD modes, and auxiliary clocked LSSD modes.

As shown by FIG. 3A, portions of combinational logic 305 that aredirectly stimulated by primary input 309 and that are directly coupledby its output to primary line 317, can be readily tested for faults(faulty circuitry) by direct application ol test vectors (predetermineddata patterns) over line 309 and by direct observation of the outputover line 317. However, this represents only a small percentage of thelogic of block 305. In typical applications, most of the logic gateswithin block 305 receive their inputs from memory cells within 301 andforward their outputs to other memory cells within 301. In order toaccurately test the combinational logic within block 305, DFT processesprovide a mechanism for isolating different logic groups within block305 by (1) passing test vectors into the memory cells 301, (2) allowingthe stimulated logic groups to store the product information inpredetermined memory cells, and then (3) recalling the output fromstimulated portions of block 305 from memory 301.

In effect, test vectors are scanned into the memory cells during testmode, the combinational logic is operated and its output is thencaptured in the memory cells to be scanned out.

FIG. 3B illustrates a portion of the structure of the mapped memorycells within unit 301 as typically specified or inferred in an HDLcircuit description without the addition of test circuitry; these cellsare called non-scan cells. Unit 301 consists of a plurality ofindividual non-scan memory cells. In this example, five exemplary Dflip-flops are illustrated 307a-307e. Each non-scan memory cell307a-307e receives an input 306a-306e from either a primary input offchip, from the combinational logic block 305, or from another memorycell within block 301. These non-scan memory cells also have outputs309a-309e that typically drive combinational logic gates or other memorycells or can be a primary output that goes off chip.

Within one embodiment, a scan replacement process of the presentinvention replaces the non-scan memory cells 307a-e of unit 301 withscannable memory cells 320a-e (FIG. 3B) that are provided for DFTfunctionality and links them together to form a scan chain. A scannablememory cell is also called a test memory cell. Memory cells are alsocalled sequential cells. The result is shown in memory unit 320 of FIG.3B which contains scannable memory cells 320a-e in a scan chainconfiguration. This configuration 320 accommodates testability for stuckat faults and allows loading of test vectors into the cells and scanningof certain data out of the cells in response to the application of thetest vectors.

Herein, a non-scan or unscan memory cell indicates a memory cell thatdoes not support scanning, e.g., cell elements 307a-e of FIG. 3B. A scancell or scannable cell indicates a memory cell that supports scanning,e.g., cells 320a-e of FIG. 3B. An unscannable or nonscannable memorycell indicates a memory cell that is violated or otherwise userindicated as not to be scanned.

The scannable memory cells 320a-e in this example consist of multiplexedinput D flip-flops and are linked together in chains, as shown, to formshift register configurations. Each particular scannable memory cell canbe analogous to each other cell, so one cell 320a is described herein.Scannable cell 320a contains a memory cell 321a and a multiplexer 323.The D input of cell 321 a is coupled to the output of a multiplexer 323which has a select line input 325, called the scan enable or SE line.The data inputs to the mux 323 are (1) an I input 327 analogous to input306a from unit 301 and (2) an SI shift input 329 which originates from aprevious scannable memory cell or from a primary input provide cell 320ais the first cell in a scan chain. It is appreciated that the output ofthe mission mode logic 305a is typically routed (e.g., by a designer) tothe I inputs 327 of the scannable memory cells for observation or to aprimary output. The output 331 of cell 321a is routed to mission modecircuitry 305a of the combinational logic 305 and is also routed toanother scannable memory cell (e.g., 320b) or to a primary output, ifthis cell is the last cell of a scan chain. It is appreciated that the Qor /Q output pin can be utilized in the chaining configuration.

In this configuration, scannable memory cells 320a-e comprise anexemplary scan chain because the output of one sequential cell iscoupled to the input of an adjacent sequential cell.

The circuitry shown in unit 320 has two modes, mission and test. Inmission mode, the SE lines are not asserted and data is selected by themux from the I inputs. In test mode, the SE inputs of each scannablememory cell 320a-e are asserted such that the shift inputs SI are activeand a test vector (string of bits) can be inserted into the integratedcircuit through a primary input and shifted through the scannable memorycells 320a-e and then applied to the appropriate logic 305a-e coupledthereto. The product information generated by the tested logic (305a-ein this example) is then stored in a scan chain and shifted out to aprimary output for observation. In this manner, the testability of thecombinational logic block 305 is greatly enhanced as logic essentiallyburied deep within a pipeline of the integrated circuit (e.g., coupledonly to scannable memory cells) can be directly isolated and tested byshifting test vectors into a scannable memory cell chain and thenshifting the product back out to a scannable memory cell chain forobservation.

Not all scannable memory cells can be included within a scan chain for avariety of reasons. Some memory cells do not offer a scan-in ability ordo not offer a scan-out ability due to logic considerations. Further,some cells cannot capture data at their data in port. Also, some memorycells directly control the reset, clock or input line of another memorycell in a scan chain. In this case, when the test vector is loaded intothe scan chain, portions of the test vector can be altered, e.g., wherethe test vector data loaded into a first scan cell is used as a set orreset signal applied to a second scan cell thereby overwriting the testdata in the second scan cell of the scan chain. Lastly, some memorycells, after the scan chain is constructed, do not offer scannabilityfor various logic reasons, e.g., the clock input is not clockedproperly.

Memory cells that cannot be placed into a scan chain are violated by thepresent invention and removed from its scan chain. FIG. 4 illustrates anexemplary violated memory cell. A sample scan chain as output fromconstraint driven scan insertion block 645 of the present invention(FIG. 8) is shown including scannable memory cells 320a-320c of memoryunit 301. Block 645 is described further below. Assume that logic block402 of FIG. 4 is a scannable memory cell and directly controls eitherthe clock input or the set/reset input of memory cell 321d. Becausememory cell 321d is directly controlled by another sequential element,this memory cell 321d is violated (e.g., eliminated) from the scan chainso that only cells 321a-c remain in the chain. As shown in FIG. 4, theviolated cell 321d is not scan replaced. Cell 321c can be the lastmemory cell of the scan chain or can be coupled to another memory cell(not shown). In either case, the output of cell 321c is not coupled toviolated cell 321d.

Although not shown, it is appreciated that combination logic 305 outputsto primary outputs (off chip) or to scannable memory cells. The presentinvention utilizes a well known procedure, Design Rule Checking (DRC),to perform the above analysis to check for violated scan cells. Nomeaningful observations are possible for circuitry coupled to violatedmemory cells.

An unfortunate result of violating memory cell 321d is that thecombinational logic block 305' is no longer able to be stimulated by theDFT circuitry. It is appreciated that if a memory cell is violated thathappens to be the observation cell for block 305', instead of being thestimulus cell, this condition also results in block 305' falling out ofthe DFT circuitry because its results cannot be observed. As discussedfurther below, the design rule checking (DRC) process of the presentinvention determines which scans cells to violate of a constructed scanchain and also performs other functions. A violated scan cell is calledan unscannable memory cell.

As discussed further below, the computer implemented TR compiler 625(FIG. 8) of the present invention performs two important functions inpredicting the impact of the DFT circuitry on the mission modecircuitry. First, the TR compiler 625 of the present invention replacesthe non-scan cells (e.g., HDL specified or inferred memory cells) withscannable memory cells. An HDL specified or inferred memory cell is anon-scan memory cell. This process includes an equivalence test in orderto provide the proper scannable memory cell for replacement. Secondly,in order to simulate the presence of the links that connect thescannable memory cells into chains, the TR compiler 625 of the presentinvention provides loopback circuits that couple the output (either Q or/Q) of a given scannable memory cell back to the scan input (SI) of thesame memory cell. This loopback connection provides the TR compiler 625of the present invention with enough information to determine teelectrical impact of the DFT circuitry on its design without knowing theactual routing of the chain, which will be determined by subsequentlydriven processes. In this way, the TR compiler 625 of the presentinvention can generate its design with enough information regarding theDFT circuitry so that the circuit constraints associated with the HDLspecification can more likely be satisfied when the DFT circuitry iscompleted. The loopback connection is discussed in more depth furtherbelow.

FIG. 5A illustrates a sample non-scan memory cell. The exemplary cell415 shown in FIG. 5A is an edge triggered D flip-flop, however, the preent invention can readily operate with a level sensitive latch orregister (e.g., LSSD mode). Once the TR compiler 625 of the presentinvention performs its mapping functions with a particular technology,the TR compiler 625 replaces the HDL memory cell 415 with an equivalentscannable (or "test") memory cell 425 shown in FIG. 5B. As shown in FIG.5B and discussed previously, a scannable edge triggered memory cell 425contains a multiplexed input with mux 430 driving the D input. The mux430 receives two inputs, an SI input 437, which is a scan input, andanother input, I, 435, which is a data input. A scan enable select lineSE 439 provides the selection control for data or scan input. Using line439, scan input (SI) is selected during test mode and data input (I) isselected during the mission mode.

As is often the case, memory cells contain additional ciruitry asidefrom the circuitry that performs their sequential memory functions. FIG.5C illustrates one exemplary non-scan memory cell 417 that also containscombinational logic 450 in addition to its circuitry to perform itsmemory functions. The TR compiler 625 of the present invention willaccount for this combinational logic 450 when determining an equivalentscannable memory cell for replacement. This determination process iscalled scan equivalence and is described in more detail below. FIG. 5Dillustrates an equivalent scannable (or "test") memory cell 427 thatcontains the combinational logic 450 that was also present in thenon-scan memory cell 417 of FIG. 5C as well as the multiplexed input toprovide for scan chaining. In this fashion, the scannable memory cell427 is equivalent to the non-scan memory cell 417 and will be used bythe TR compiler 625 of the present invention to replace cell 417.

After the sequential circuitry (e.g., the memory cells) have beenreplaced with scannable memory cells, the present invention TR compiler625 provides loopback connections to effectively simulate the scannablememory cells in a scan chain configuration. FIG. 6A illustrates aloopback connection 440 of the present invention from the Q output ofscannable memory cell 425 to the scan input SI of mux 430. In operationfor DFT purposes, while memory cell 425 will not be so coupled, theloopback connection 440 will accurately simulate the electricalcharacteristics seen by the Q output port of cell 425 because, in a scanchain, this port will likely be coupled to another scannable memory cellhaving similar input characteristics as mux 430 and since the compilewire model uses a unit delay model for this connection. Similarly, theloopback connection 440 accurately simulates the electricalcharacteristics seen by the mux 430 of cell 425 because in its scanchain mux 430 will likely be coupled to another scannable memory cellhaving similar output characteristics as the Q output of cell 425 andsince the compile wire model does not distinguish between inter andintra module connections. As shown in FIG. 6A, the I input of mux 430can be coupled to other combinational logic 305' over line 435 and the Qand /Q outputs of cell 425 can be couple I to combinational logic 305.

During compilation, the TR compiler 625 of the present invention willtherefore accurately operate on this scannable memory cell 425 with theloopback connection so that constraints will more likely be met afterthe DFT circuitry is complete.

It is appreciated that the loopback connection 440 employed by the TRcompiler 625 of the present invention can also originate from the /Qoutput of cell 425 as shown in FIG. 6B. Here, the loopback connection440 is coupled from the /Q port of cell 425 to the SI input of mux 430.The selection of Q or /Q will depend on information determined by the TRcompiler of the present invention. This information can originate fromlibrary attribute information (e.g., indication of a test scan out)associated with the scan cell. It is appreciated that the selection of Qor /Q is not vital to the present invention because logically eitheroutput of cell 425 can be utilized for the scan chain. What isimportant, however, is that at least one output Q or /Q be selected witha loopback connection 440 for optimization. As shown in FIG. 6B, the Iinput of mux 430 can be coupled to other combinational logic 305' overline 435 and the Q and /Q outputs of cell 425 can be coupled tocombinational logic 305.

FIG. 7 illustrates a circuit modification performed by the presentinvention after the TR compiler 625 is run. Since scan chains can spanbetween module, this modification occurs when a link of a scan chainspans more than one module. A module represents a design portion thatcan be separated by some area within the integrated circuit. The TRcompiler 625 applies a unit delay associated with each loop backconnection 440 which will accurately characterize most scan links thatare intra-module but will not accurately model scan links that areinter-module. In the latter case, a gating element is added to satisfythe assumption made by the TR compiler 625.

In FIG. 7, the link spans between two memory cells, one in module A(cell 425) and another in module B (not shown). In these cases, themodified scan insertion process (to be discussed below) of the presentinvention places a gate 510 (e.g., an AND gate) in the link between themodules. One input of the gate 510 is coupled to scan enable (SE) line439. With the gate 510 inserted in the scan connection; this physicallydisables the scan path when SE 439 is not asserted. All the scan pathsees electrically is the capacitance of the gate input. Within the wiremodel of the TR compiler 625, a unit delay is assumed for this scanpath. Using the configuration of FIG. 7, the loopback connection 440 ofthe present invention effectively simulates the mission modecharacteristics of this link because the gate 510 hides the missioneffects of the long scan link between modules A and B as seen by the Qoutput of cell 425 and is consistent with the assumptions made by the TRcompiler 625. Thus, the addition of gate 510 accurately models the unitdelay assumption made in the TR compiler 625 for the scan link.

FIG. 8 illustrates an overall flow diagram of a synthesis process 600 inaccordance with embodiments of the present invention and its logicblocks are implemented within the computer controlled CAD systemdescribed above. Flow 600 includes the TR compiler (test ready compile)625 of the present invention and the modified scan insertion procedure645 (constraint driven scan insertion) of the present invention. Flow600 also includes a generic HDL compiler 615 which will be describedbelow but is different from the TR compiler 625 of the presentinvention. Generic compiler 615 is analogous to generic compiler 203.

The flow 600 receives an HDL description 605 of an integrated circuitlayout along with a set of design constraints 610 (including design rulelimitations, and performance limitations, such as area, timing, power,etc.) that are pertinent to the circuit described by the HDL description605. Design rules as used herein refer to maximum fanout, maximum signaltransition time, and maximum node capacitance. The HDL description 605can be of a number of different formats, such as VHDL or Verilog and canalso re-present an entire IC design, but typically represents a moduleof the overall IC design. The HDL description 605 can be stored in acomputer memory unit (e.g., unit 102 or 104) and is fed into an optionalgeneric compiler logic block 615 that is well known in the art. Thisgeneric compiler 615 transforms the HDL description 605 into atechnology independent netlist 620 that is more readily recognized bythe TR compiler 625. Block 615 performs a process on the input netlist620 to generate a technology independent or generic netlist of the IClayout by interfacing with a synthetic library or "designware" library.Technology independent netlist 620 is composed of logical primitives andoperators of the IC layout but the components described therein containno structure. The generic compile process 615 and resultant output 620are well known in the art.

The netlist 620 of FIG. 8 generated by block 615 is input to the TRcompiler logic block 625 of the present invention. The TR compiler 625is described in more detail in FIG. 9 which illustrates the particularstages within the TR compiler 625 where the sequential circuits arereplaced and where the loopback connections are inserted. The TRcompiler 625 in FIG. 8 performs a process on the generic netlist thatinterfaces with a technology specific library, "cell" library, so that amapped netlist can be generated that includes specific structuralinformation about the components used in the compiled design. Inaddition, the TR compiler 625 of the present invention: (1) replaces theHDL memory cells specified in or inferred from the netlist 620 (e.g.,non-scan cells) with scannable memory cells; and (2) inserts loopbackconnections in each scannable memory cell added. FIG. 5A, FIG. 5B, FIG.5C and FIG. 5D illustrate the memory cell replacements performed by thepresent invent ion while FIG. 6A and FIG. 6B illustrate the addition ofthe loopback connections performed by the present invention TR compiler625.

Referring to FIG. 8, by performing the above processes, the TR compiler625 of the present invention is able to better optimize for the eventualconstruction and completion of the DFT circuitry. In this way, it ismore likely that the resultant test circuit design will meet constraints610. The output of the TR compiler 625 of the present invention is anon-scannable technology dependent netlist 630 that comprises scannablememory cells with loopback connections 440. Although called"nonscannable" because ol the loopback connections, netlist 630 isnevertheless a fully scan replaced netlist in that the TR compiler 625replaced each HDL specified or inferred sequential cell (e.g., non-scancell) by ar equivalent scannable cell. This netlist 630 can be stored ina memory unit of the computer system such as RAM 102 or the storagedevice 104 (FIG. 2). The overall non-scannable net list 630 is optimizedto constraints 610 and is a gate level mapped netlist and therefore istechnology specific.

The non-scannable netlist 630 of FIG. 8 is then input to a DFT designrule checker logic block 635 ("DRC"). Any of a number of well known DRCprocesses can operate within the present invention including a DRC asdescribed by E. B. Pitty, D. Martin, and H. T. Ma in a paper entitled "ASimulation-Based Protocol-Driven Scan Test Design Rule Checker,"published in IEEE International Test Conference, page 999, paper 40.2(1994). The DRC 635 checks the scannable memory cells in the netlist 630to determine which cells should be violated according to the discussionherein with respect to FIG. 4. Processing flows to DRC block 635' wherethose scannable memory cells that are determined not to be part of ascan chain by DRC block 635 are marked as violated by the DRC at logicblock 635'. Cells marked as violated will be unscanned by the modifiedscan insertion process 645. The act of unscanning replaces the violatedmemory cells with an equivalent non-scan memory cell. When unscanned,the loopback connection 440 associated with a violated cell is alsodestroyed. When violated, the memory cell is referred to as unscannable.

Referring to FIG. 8, the output of the DRC block 635' is a non-scannablenetlist (like 630) but with the violated memory cells marked so as to beunscanned. The output of the DRC 635' is input to a modified scaninsertion and routing logic block 645 of the present invention which isalso called "constraint driven" scan insertion. This procedure 645 isfurther described in FIG. 10. Referring to FIG. 8, the modified scaninsertion and routing procedure 645 of the present invention breaks theloopback connections 440 in the scannable memory cells of netlist 630.Also, for memory cells that are not violated but remain non-scanned(e.g., the cells are still HDL specified sequential cells because theywere output from a prior art compiler), process 645 replaces thenon-scan memory cells with scannable memory cells. It is appreciatedthat the TR compiler 625 of the present invention does not (output anynon-scan yet not violated memory cells to logic block 645. Logic block645 also unscans any memory cell marked as violated.

Logic block 645 of FIG. 8 allocates resources to construct the scanchains between the scannable memory cells. Well known methods andprocedures for linking scannable memory cells can be used to create thescan chains for DFT. The present invention modified scan insertionprocess 645 then performs a reduced set of compiler procedures(constraint driven procedures) that are described with reference to FIG.11A and FIG. 11B in order to efficiently and effectively optimize theresultant circuitry to the original constraints 610. These compilerroutines of logic block 645 are different from the incremental compilestep of the prior art in that there are different levels of userselectable effort applied to the compilation process, some of which areapplied only to the circuitry added for DFT implementation. Theresulting process is more efficient and requires substantially less timeto process while yielding excellent results in optimizing the overallcircuit layout to the constraints.

The output of logic block 645 of FIG. 8 is a mapped scannable netlist650 at the gate level. This netlist 650 contains valid scan chains ofscannable memory cells used to receive test vectors. The netlist 650 isincorporated, along with other module scannable netlists, into ascannable system netlist 653 by system composition block 652 The systemlevel scannable netlist 653 is then input to an automatic test patterngeneration procedure (ATPG) 655. ATPG procedures are well known in theart and are used to generate the test vectors 660. If the netlist 650contains a full scan then a combinational ATPG process can be used atlogic block 655 to generate the test vectors. If the netlist 650 is theresult of a partial scan, in which only some of the memory cells arescanned while others are not, then a sequential ATPG process can beexecuted at block 655 to generate the test vectors. Both thecombinational and sequential ATPG processes are well known. Also atblock 655, ATPG formatting is performed in which the test vectorsgenerated by the ATPG processes are altered or modified so that theywill operate with a particular specified set of test equipment thataccepts a particular format as input.

The test vectors 660 generated from block 655 are therefore customizedto a particular test equipment protocol. These test vectors 660 can thenbe used along with the particular test equipment to load test vectorsinto the result IC chip to check for faults within the chip.

FIG. 8 illustrates two separate embodiments of the present invention.The first embodiment is the sequential element replacement and theaddition of the loopback connections in the TR compiler 625 of thepresent invention. The second embodiment represents the addition of thereduced set of compile steps (e.g., constraint driven) performed by themodified scan insertion process 645 of the present invention. Both ofthese processes are described in more detail to follow.

TEST READY COMPILER

FIG. 9 illustrates processes of the computer system 112 implemented TRcompiler 625 of the present invention. As will be discussed below,several of the processes (e.g., logic blocks) shown in FIG. 9 aredescribed for completeness in terms of a particular embodiment of thepresent invention. Several of these processes are optional and, withoutdeparting from the scope of the present invention, can be omitted orreplaced with well known procedures. The TR compiler 625 of the presentinvention starts at logic block 710 (optional) where finite statemachine (FSM) optimization is performed. At logic block 710, statemachine encoding or descriptions are translated into recognizable orstandardized HDL primitives. At logic block 710, the compiler 625processes each FSM in the design and optimizes their state encoding andthen converts them to Boolean equations and technology-independentflip-flops.

At optional logic block 715, subdesigns in the HDL are ungrouped toallow merging of HDL from FSM designs with other HDL portions. Bydefault, subdesigns are compiled hierarchically, preserving their designboundaries. However, this optional logic block 715 will ungroup thesubdesigns into their parent designs before being compiled by otherprocedures. At logic block 720, the present invention TR compiler 625performs high level optimizations including resource allocation andsharing depending on timing and area considerations. Additionaloptimizations such as arithmetic optimization and the sharing of commonsubexpressions are also performed at logic block 720. In resourcesharing, when possible, particular operators such as adders, registers,etc. are shared between different expressions.

Referring to FIG. 9, at logic block 725, synthetic libraryimplementation selection is performed. In this process, genericoperators are mapped to architectural representations (implementations).Generic operators that can map into more than one architectural genericcomponent are assigned to a particular architectural element based onthe constraints 610. At block 725, a technology independent designwarelibrary can be used to select to proper architectural representation.The designware library is created by a designware developer.

The implementation used in block 725 is dependent on the definedconstraints 610 (FIG. 8) and the results of logic block 725 containtechnology independent generic cells. At logic block 730 of FIG. 9,sequential inference is performed by the TR compiler 625 of the presentinvention. At step 730, a particular memory style or mode can beselected, e.g., it can be determined whether to use edge triggeredflip-flops or level sensitive latches for the sequential elements, or touse another mode. At logic block 730, the selectedtechnology-independent memory elements are then inserted into thedesign.

At logic blocks 735 and 740, the present invention performs two optionaland technology independent processes to flatten and/or add structure tothe design. These processes are well known. At logic block 735, anoptional flattening step is performed where the logic is reduced inorder to eliminate intermediate variables and results. Duringflattening, intermediate variables and therefore intermediate logicstructure is removed from a design. While advantageous, flattening canhave an adverse effect on CPU 101 processing time and design area TableI below gives an example of a design before and after flattening.

                  TABLE I                                                         ______________________________________                                        Before              After                                                     ______________________________________                                        f0 = a t0           f0 = ab + ac                                              f1 = d + t0         f1 = b + c + d                                            f2 = t0' e          f2 = b' c' e                                              t0 = b + c                                                                    ______________________________________                                    

At logic block 740, the present invention performs an optional logicblock to structure the design. Structuring is an optimization step thatadds intermediate variables and logic structure to a design. Duringstructuring, the TR compiler 625 searches for sub-functions that can befactored out, and evaluates these factors based on the size of thefactor and number of times the factor appears in the design. Thesub-functions that most reduce the logic are turned into intermediatevariables and factored out of the design equations. Table II belowillustrates results of structuring a set of equations. In this example,the sub-function t0 is isolated as an intermediate variable and thenfactored out of the remainder of the design.

                  TABLE II                                                        ______________________________________                                        Before               After                                                    ______________________________________                                        f0 = ab + ac         f0 = a t0                                                f1 = b + c + d       f1 = d + t0                                              f2 = b' c' e         f2 = t0' e                                               t0 = b + c                                                                    ______________________________________                                    

After logic block 740, the TR compiler 625 of the present invention, atblock 750, translates the sequential elements that were inferred bylogic block 730. At logic block 750, the memory cells inferred by logicblock 730 are translated into technology dependent non-scan memory cellsof a particular target technology. As shown, the technology library 743is coupled to interface with logic block 750 to provide the cell libraryfor translation. At block 752, the present invention performs scan cellreplacement where the technology independent non-scan cells translatedby block 750 are then replaced with technology dependent scannablememory cells (e.g., the cell becomes "scan replaced"). The translationprocedure 750 performed by the present invention is done with knowledgeregarding the types of scannable cells available within the targettechnology library 743 so that technology dependent non-scan cellsinserted by process 750 can be replaced with equivalent scan cells byreplacement process 752.

The process of replacement (step 752) involves an equivalencedetermination because, as shown in FIG. 5C and FIG. 5D, the sequentialmemory cells can include additional circuitry that needs to be matchedduring the translation. To perform the equivalence, each component inthe technology library 743 contains an ASCII function identificationstring ("function identifier") that describes in a particular formatwhich functions are implemented in any particular device.

The technology dependent non-scan memory cells placed into the circuitat logic block 750 of FIG. 9 contain similar ASCII function IDs. Thepresent invention at logic block 752 utilizes the function IDs of thenon-scan memory cells to compare against function IDs of the scannablememory cell elements of the target technology library 743 to locatescannable memory cells with equivalent functionality as the non-scanmemory cell. In this way, functionally equivalent scannable memory cellscan be used to effectively replace the non-scan memory cells. Technologylibraries 743 include timing information (e.g., set up and hold timing)that is used by the present invention at block 752 to arbitrate betweenfunctionally equivalent scan cells for replacement. It is appreciatedthat TR compiler 625 can also use sequential mapping as an alternativeto function ID-based equivalence during replacement 752. In accordancewith the present invention, once the scannable memory cells are inplace, the resultant design represented in a netlist description can bestored in RAM 102 or storage device 104 of the computer system 112 orany media storage unit.

It is appreciated that in an alternate embodiment of the presentinvention, logic blocks 750 and 752 can be combined into one replacementstep wherein the HDL specified generic sequential cells are directlyreplaced by technology dependent scannable sequential cells.

At logic block 755 of FIG. 9, the TR compiler 625 of the presentinvention then adds the loopback connections 440 which connect an outputof each scannable memory cell to the scan input of the same cell tosimulate a link in a scan chain as shown in FIG. 6A and FIG. 6B. This isperformed for each scannable memory cell. In the case of D-flip flopcells, the loopback connection 440 can connect from the Q or /Q outputto the preferred scan input (SI). In accordance with the presentinvention, once the scannable memory cells have loopback connections 440installed, the resultant design represented as a netlist description canbe stored in RAM 102 or storage device 104 of the computer system 112 orany media storage unit.

At logic block 760, the TR compiler 625 then performs a second optionalflatting procedure analogous to logic block 735. The second optionalflattening procedure is used to perform flattening, if desired, givenknowledge of the electrical characteristics of the elements added byblocks 750, 752 and 755 of the present invention. This additionalknowledge provides a more refined flattening output. At logic block 765,the TR compiler 625 then performs a second optional structure addingprocedure analogous to logic block 740. The second optional structureadding procedure is used to add stricture, if desired, given knowledgeof the electrical characteristics of the elements added by blocks 750,752 and 755 of the present invention.

At logic block 770 of FIG. 9 an initial combinational mapping step isperformed by the present invention where the generic cells("architectural implementations") of the design are replaced withtechnology specific components and more specific timing information andarea usage can be obtained. Logic block 770 is linked with the selectedtechnology library 743 (also called a "cell library") defining structureand function of particular elements and components that can be used toreplace the generic cells of the design. During block 770, the firstfunctionally equivalent library cell is selected. At the completion ofblock 770, a technology dependent netlist is generated.

At logic block 775 of FIG. 9, the present invention TR compiler 625performs input/output pad insertion and optimization. I/O pad insertionis the process of adding I/O buffers to primary inputs and outputs of adesign. The characteristics of instantiated I/O pads can be predefined,such as voltage levels, current levels, pull up and pull down resistors,slew rate control, and so forth. I/O pad optimization modifies I/O padsthat were previously inserted to meet constraints 610. At logic block775, I/O pads can be optimized to include sizing and incorporation ofcore logic functionality into the pads. Any of a number of well knownprocedures can be used at this step.

At logic block 785 of FIG. 9, the present invention TR compiler 625 thenperforms mapping optimization. The goal of mapping optimization 785 is agate level implementation that meets the constraints 610 defined in FIG.8 (e.g., timing, porosity, area, power consumption, etc.). It is appliedafter the final implementation of the sequential logic is determined.Specifically, logic block 785 of the present invention compares each ofits circuit implementations against the constraints 610 to gauge theoverall quality of that implementation. Constraints 610 determine whichtransformations are accepted and which are rejected. There are a numberof processes that operate at this step in order to optimize the overalldesign to satisfy the design rule and performance constraints 610. Ingeneral, logic block 785 identifies critical design points throughoutthe entire design and applies optimization techniques to these points inan attempt to satisfy the constraints 610. A critical point is a pointthat does not satisfy determined constraints. A critical load is a loadassociated with a critical path that does not meet determinedconstraints. A noncritical load a load associated with a path thatsatisfy determined constraints.

Logic block 785 is applied in an iterative basis over the entire designin an attempt to satisfy constraints 610. During this iterativeapproach, logic block 785 contains a heuristic that can applyoptimizations that locally increase circuit cost, but over the length ofthe process can reduce circuit cost when applied iteratively over theentire design. Some of the processing of optimization logic block 785applied at critical circuit points includes: sizing drivers and loads;phasing; buffering; downsizing; isolation; offloading; balancing; andsplitting. These are described in more detail with respect to FIG. 12.One embodiment of the present invention advantageously utilizes some ofthe optimization features of logic block 785 in the modified scaninsertion process 645 which is described further below.

At logic block 790, the present invention TR compiler 625 performs logicverification to determine if the design generated by logic block 785 andthe HDL design input at logic block 710 are functionally equivalent.Timing considerations are ignored for logic verification. In logic block790, Boolean matching is performed between the two designs at specifiedpoints within the logic design. Logic block 790 can also be used tocheck manual implementations or changes against the HDL specification.The processing of the TR compiler 625 then exits.

The output generated by the TR compiler 625 of the present invention(see FIG. 8) is a non-scannable netlist description 630 of the design.This netlist description 630, which can be stored in a computer memoryor storage unit, contains scannable memory cells (scannable sequentialcells) with loopback connections 440 associated with each scannablememory cell.

It is appreciated that the netlist 630 is non-scannable because theloopback connections 440 do not form proper scan chains. However, sinceeach sequential cell has been translated at block 750, and scan replacedat block 752, although the netlist is non-scannable it is neverthelessfully scan replaced.

MODIFIED SCAN INSERTION INCLUDING CONSTRAINT DRIVEN COMPILE

An embodiment of the present invention includes a modified scaninsertion logic block 645 that performs a modified version of compile(referred to as "constraint driven" compile) that is applied to thetranslated and replaced sequential cells added by the TR compiler 625 ofthe present invention. By selectively applying portions of compileroutines in a particular fashion during the scan insertion process, thisembodiment of the present invention avoids the time consumingincremental compile process required of the prior art. Instead, themodified scan insertion process 645 provides a tiered effort "constraintdriven" compile routine that provides effective results withsignificantly reduced process time.

Further, by application of specific optimization routines during thescan insertion phase, the present invention offers increased flexibilityat this phase because logic changes can be more readily made during scaninsertion that alter the gate level connections. These optimizations arediscussed below.

Specifically, the modified scan insertion block 645 of FIG. 8 isdescribed in more detail with reference to FIG. 10. The logic blocks ofFIG. 10 are implemented within computer system 112. Unlike the presentinvention TR compiler 625, prior art compilers do not provide scannablesequential cell translation. Therefore, the modified scan insertionlogic block 645 of the present invention provides logic for processing(1) netlists that have scannable sequential cells and (2) those netliststhat do not have scannable sequential cells so that block 645 iscompatible with both compiler types.

With reference to FIG. 10, logic block 645 begins at logic block 805which receives a netlist output by the DRC 635 (FIG. 8). This netlistcan contain sequential cells that are marked as violated. Block 805unscans those sequential cells that are marked as violated or marked asotherwise "do not scan." These cells are unscannable. This isessentially a reverse of the replace sequential cells block 752 and atblock 805 the scannable sequential cell is replaced by a timing andfunctionally equivalent unscannable sequential cell.

It is appreciated that block 645 can also receive input from a prior artcompiler which does not perform scannable sequential translation. Insuch case, the input netlist can contain non-scan memory cells (e.g.,they are not scannable sequent al cells). At block 810, the presentinvention identifies those non-violated sequential cells that are notscan replaced. The present invention modified scan insertion block 645then scan replaces these identified sequential cells with equivalenttechnology dependent scannable memory cells.

The process of sequential cell replacement applied at block 810 of FIG.10 is analogous to the sequential replacement block 752 of FIG. 9 thatcan utilize a function identification string for comparison. If theabove string identification approach fails, the present inventionapplies sequential mapping. Sequential mapping is performed at block 810based on design rules and area constraints while performance constraintsare not considered.

At logic block 815 of FIG. 10, the present invention modified scaninsertion process 645 determines which cell output to select asappropriate to couple a particular scannable sequential cell to anotherscannable sequential cell in a valid scan chain. Assuming an exemplary Dflip-flop model, block 815 selects the Q output, or the /Q output, anequal cell output, or opposite cell output. This selection is based onrules that attempt to best optimize to the constraints 610. With equalcells outputs, the output having the most slack (e.g., most favorabletiming characteristics) is selected to be the scan-out driver to betteroptimize the design to timing constraints. In general, at block 815, thepresent invention attempts to select the output that will least likelyimpact the mission mode circuitry of the overall design. In other words,if one output (e.g., Q) is coupled to critical mission modecombinational logic, the present invention will attempt to connect thissequential cell to the scan chain using the other output (e.g., /Q).

At logic block 820, the modified scan insertion block beaks the loopbackconnections 440 inserted by the TR compiler 625 of the presentinvention. The loopback connections 440 or "routing" lines are brokenbecause block 645 determines and routes the proper connections to formvalid scan chains among the scannable sequential cells. The loopbackconnections 440 were temporarily installed during the HDL compile 625steps to simulate the electrical characteristics of the actual scanchains for constraint optimization. At this time, they are removed. Itis appreciated that blocks 815 and 820 can operate simultaneously.

At logic block 825 of FIG. 10, the present invention performs allocationof resources and routing to construct the scan chains between scannablesequential cells by linking non-violated scannable sequential cells.Also added during block 325 are other circuit resources required toconstruct the scan chains, such as signals lines for clock signals andscan enable signals, etc. The processing of block 825 required todetermine the scan chains, allocate the proper resources to constructthe chains, and route the input and output lines and gates accordinglyis well known in the art. Any of a number of well known procedures cantherefore be used consistent with the scope of the present invention atthis stage to construct the scan chains. At block 825, the presentinvention will construct the scan chains differently, as is known in theart, depending if a full or partial scan is required.

Logic block 645 also determines if the added DFT implementations causeany potential circuit failures as a result of the application of testvectors. After scan insertion, additional logic may be needed to protectagainst potential problems that are encountered as a result of shiftingtest vector bits into memory cells that normally would never have theseinput values. For example, the application of the test vectors can causebus shorts. In these cases, disabling logic elements are added by block645 to prevent the potential short circuit.

At logic block 830 of FIG. 10, the present invention determines if aparticular scan connection (e.g., between two consecutive scannablesequential cells) spans between modules. If so, then the presentinvention places an isolation gate between the output port of theupstream scannable cell and the downstream cell, this is shown in FIG.7. This is referred to as hierarchical isolation. By providing the aboveisolation logic, the addition of the loopback connections 440 by the TRcompiler 625 still accurately represents the electrical characteristicsof a scan chain connection that spans across two modules.

At the completion of block 830, a number of components are added to theoriginal circuit design that was input at block 810. These additionswere not optimized to satisfy performance or design rule relatedconstraints. Therefore, logic block 835 is provided by the presentinvention to provide a form of mapping optimization so that te overalldesign with the DFT circuitry can better meet performance and designrule constraints. These optimizations performed at block 835 are called"constraint driven" compile optimizations and are a proper subset of theoptimizations performed by the TR compiler 625 of the present invention.However, as described below, block 835 provides certain optimizationsapplicable only to the DFT circuitry (e.g., the scannable sequentialcells and the rotating for the scan chains) in a tiered effort approach.In other instances, block 835 provides optimizations along criticalpoints of the entire netlist design. The combination of the abovetechniques provides an effective optimization scheme for meeting designrule constraints and attempting to match performance constraints whilenot consuming a large amount of processing time.

Constraint driven compilation block 835 will be described in more detailwith reference to FIG. 11A which shows processes to reach performanceconstraints and FIG. 11B which illustrates processes to reach designrule constraints (e.g., maximum fanout, maximum capacitance, maximumsignal transition). Assuming the constraints are met by application ofblock 835, the output of block 835 is a scannable netlist of die circuitdesign.

With reference to FIG. 11A and FIG. 11B, the constraint driven compileblock 835 of the present invention is described. Logic block 835receives two selection controls as input (1) an effort selection (map₋₋effort) that can be low, medium or high and (2) an ignore design rulesselection (ignore₋₋ compile₋₋ design₋₋ rules) that can be true or false.Processing starts at block 850 where the present invention determines ifa performance constraint (e.g., timing, etc.) exists and is violated. Ifthere are no performance constraints or none is violated, then block 835flows to node "B." If a performance constraint violation appears,processing flows to logic block 852.

At logic block 852, the present invention performs a number ofoptimizations ("size design") along critical points only of the portionof the netlist that was introduced as a result of DFT processes. Withinprocess 852, the entire design is not modified, only the portion ordomain that comprises the DFT implementations, e.g., those portions thatwere added by insertion of the scannable sequential cells, the scanchains, anti any other logic added as a result of the DFTimplementations. The present invention is able to advantageously performthis type of size design process 852 because the present invention isknowledgeable about the DFT introduced circuits and can limit thisapplication of the size design to these circuit portions. Theseoptimizations are applied in accordance with design dependent heuristicsthat are described below. The optimizations performed on the criticalpoints of the DFT introduced circuitry in block 852 include sizing,phasing, buffering, downsizing, isolation, off-loading, balancing, andsplitting. These optimizations and their application heuristics aredescribed in more detail in FIG. 12.

At block 852 of FIG. 11A, the cells that were added as a result of DFTare examined and a number of critical points and paths within the designare determined. These points generally contain negative slack values(e.g., they do not meet the timing constraints) and are ranked in a set.Therefore, according to one optimization employed at block 852, anydrivers that belong to the DFT implementation at these points areincreased in drive strength to increase the amount of slack at thesepoints. By increasing the drive strength of these drivers at thesepoints, the node has a better chance of meeting timing constraints.After a size increase in a particular driver is implemented, a newranked set of critical points is generated. Other optimizations are thenprocessed and the process is then repeated. It is appreciated that block852 operates only on the DFT implementations.

At logic block 854 of FIG. 11A, the present invention checks if themap₋₋ effort input indicated low effort. If so, then processingcontinues to node "B." If at the completion of block 852, theperformance constraints are not violated and the map₋₋ effort is mediumor high, then processing also continues to node "B." If the map₋₋ effortis medium or high and performance violations still exist, thenprocessing flows to logic block 856. At logic block 856, the presentinvention performs the size design optimizations along critical pointsof the entire netlist and there is no domain restriction to the DFTimplementations as with block 852. These optimizations are applied inaccordance with design dependent heuristics that are described withreference to FIG. 12. The optimizations performed on the critical pointsof the circuit in block 856 include sizing, phasing, buffering,downsizing, isolation, off-loading, balancing, and splitting. They aredescribed in more detail in FIG. 12. A substantial portion of theprocessing performed by constraint driven compile 835 is performed atblock 856.

At logic block 858 of FIG. 11A, the present invention checks if themap₋₋ effort input indicated medium effort. If so, then processingcontinues to node "B." If at the completion of block 856, theperformance constraints are not violated and the map₋₋ effort is high,then processing also continues to node "B." If the map₋₋ effort is highand performance violations still exist, then processing flows to logicblock 860. At logic block 860, sequential mapping is performed alongcritical points only for non-scan cells. At the identified criticalpoints, surrounding combinational logic is combined with an adjacentnon-scan sequential cell and both are replaced with a complex non-scansequential cell that includes the functions of the surroundingcombinational logic. The complex non-scan sequential cell is retrievedfrom the target technology library 743 as shown. The opposite can alsooccur wherein a complex non-scan sequential cell can be replaced with asimpler non-scan sequential cell and combinational logic. This processis similar to block 750 of FIG. 9. At block 860, functional equivalenceis maintained during the mapping.

At logic block 862 of FIG. 11A, the present invention then reduces thesize of circuits along points including non-critical points byperforming local optimizations. Although this can decrease the drivestrength of some circuits, it does not introduce performance constraintviolations and has the advantage of bringing the design closer to therequired area constraints. Once block 862 is complete, the presentinvention flows to logic block 864 to perform a particular localoptimization where identified inverter pairs are replaced with singlebuffers reducing the number of extra or redundant inverters in thedesign. At the completion of block 864, the present invention thenperforms another size design optimization at critical points across theentire netlist. Logic block 866 is therefore analogous to block 856.Subsequently, processing flows to node "B" as indicated.

FIG. 11B illustrates the remainder of the processing performed by logicblock 835 of the present invention. The flow enters through node "B" andat block 868 the present invention tests if the ignore₋₋ compile₋₋design₋₋ rules selection is true or false. If true, then processingflows to block 876 and logic blocks 870, 872, and 874 are skipped. Logicblocks 870, 872, and 874 represent optimizations the present inventioncan perform to ensure that the netlist satisfies the design ruleconstraints (e.g., fanout, signal transition, and node capacitance).Generally, by the addition of an appropriate number of elements, thepresent invention can fix the design rule violations. If ignore₋₋compile₋₋ design₋₋ rules selection is false, then processing flows tologic block 870 where the fanout number of each point is checked andviolations of the fanout limitation in the design rules constraints arefixed. Violations are fixed at block 870 by adding an appropriate numberof buffers so that each point along the design satisfies the fanoutlimitations.

At logic block 872 of FIG. 11B, the present invention checks the pointsin the design to determine if any signal transitions are too slow andviolate signal transition constraints. The signal transitions can be tooslow if the nodes are overloaded. To increase signal transition time,the present invention adds buffer elements to reduce the loading effectsand/or increases the size the drivers at these nodes. At logic block874, the present invention checks the points in the design to determineif any node has node capacitance that exceed the node capacitanceconstraints. If so, the present invention adds buffer elements todecrease the loading and therefore decrease the capacitance at anyparticular node having excessive node capacitance.

At logic block 876 of FIG. 11B, the present invention checks if map₋₋effort is low and if so, processing exits from block 835. If map₋₋effort is medium or high, then processing continues to logic block 878where the performance constraints are examined. The addition of elementsby blocks 870, 872 and 874 can cause certain performance constraints tobe violated. If constraint violations are not present, then processingexits logic block 835. At this point, performance and design ruleconstraints have been met and the netlist is fully scannable.

If constraint violations are present at block 878 and this is the firstpass through logic block 835, then processing flows from block 885 tonode "A" which enters FIG. 11A at node 856. If constraint violations arepresent at block 878 and this is the second pass through block 835, thenprocessing flows from logic block 885 and exits block 835. In the lattercase, the constraint driven compile process 835 was unable to satisfythe performance constraints 610 of the design considering the DFTimplementations.

By providing the processing flow shown in FIG. 11A and FIG. 11B, thepresent invention offers a three tiered effort driven constraint drivencompile process. At low map₋₋ effort, the least amount of CPU 101processing time is consumed to size design only the circuitry added tothe DFT implementations and design rules can be checked. At medium map₋₋effort, the above is done and the size design optimization is appliedacross critical points of the entire design. At high map₋₋ effort, CPU101 processing time is not a critical factor and the above is done aswell as sequential mapping, size down, inverter reduction and anothersize design is performed. In addition, another size design optimizationis performed. For medium and high map efforts, the above is performedthrough more than one pass if performance constraints are still notsatisfied.

FIG. 12 illustrates in more detail the processing performed by the sizedesign optimization block 856 (and block 866) of the present invention.It is appreciated that FIG. 12 represents a set of processes performedalong critical points of the input circuit design domain in order tomeet performance constraints. The size design optimization block 856 (ofthe constraint driven compile 835 block) is based on design dependentheuristics which determine processes are applied to the critical pointsof the input domain. The design domain can consist of only the DFTimplemented circuitry (as for block 852) or can consist of the entiredesign (as for block 856).

Flow 856 of FIG. 12 enters at switch logic block 920 where designdependent heuristics determine which process to select from processes902-916. At the completion of an optimization block (of 902-916),processing exists and can be repeated if critical points in the designremain. The blocks selected for subsequent processing depends on thedesign and history dependent heuristics and again is selected by switchlogic 920. At block 902, the present invention increases the size andtherefore the drive capacity of a driver or increases the receivingcapability of a load. This is done to increase the response of a systemto satisfy performance (e.g., timing) constraints. Block 902 isperformed across critical points in the entire design. As discussedabove, a critical point along the input netlist design consists of adesign node wherein timing or other performance constraints are not met.

In logic block 904, the present invention moves a drive line from oneoutput of a sequential cell to its inverse output. This is calledphasing. FIG. 13A illustrates a sequential cell 1050 driving a load 1055with the Q output. In phasing, the load 1055 is moved to the inverseoutput (/Q in this example) as shown in FIG. 13B. An inverter 1057 isrequired in this case. Block 904 is performed across critical points inthe input design domain (e.g., the entire design or only the added DFTimplementations). Phasing is particularly useful in NMOS systems.

In logic block 906 of FIG. 12, buffering is performed where a number ofdifferent buffer configurations can be transformed. As shown in FIG.14A, a single buffer 1061 can be transformed to a pair of inverters 1063and 1065 as shown in FIG. 14B and vice-versa A pair of buffers can alsobe replaced with a single buffer or vice-versa. At block 906, aninverter can be transformed into a buffered inverter or vice-versa. Or,lines coupled to a node in a branch network can be individuallybuffered. These options are applied across critical points in the inputdesign domain to optimize to performance constraints. Options can beapplied using a trial and error basis.

In logic block 908, the present invention performs downsizing wherecertain loads are determined that can be moved from one location to alogically equivalent location. After the load is removed, the associateddriver is downsized to meet area constraints. FIG. 15A illustrates asequential cell 1050 with an output, Q, driving a load 1055 and otherlines 1071, 1072 driving other loads (not shown). Also shown is an othercell 1051. If is determined that Q of cell 1050 and Q of cell 1051 arelogically equivalent, then the load 1055 can be moved from cell 1050 tocell 1051 as shown in FIG. 15B. In this example Q of cell 1050 is thecritical point. The driver of cell 1050 is then downsized resulting incell 1050'. By downsizing the driver, certain constraints can be morereadily satisfied. Block 908 of the present invention is performed atcritical points in the input design domain.

In logic block 910 of FIG. 12, the present invention performs loadisolation. With reference to a critical point, all of the noncriticalloads (NCLs) are buffered leaving the critical load (CL) directlycoupled. In this way the NCLs are isolated from the critical point andpath. FIG. 16A illustrates a critical point (Q) of cell 1050 coupled totwo NCLs 1056, 1058 and one CL 1055. Isolation acts to buffer NCLs 1056and 1058 with buffers 1073 and 1074 as shown in FIG. 16B. By isolatingthe critical point Q of cell 1055 from the NCLs, certain performanceconstraints can be more readily satisfied. Block 910 of the presentinvention is performed at critical points in the input design domain.

At logic block 912, the present invention performs offloading. In thisprocess, the present invention determines NCLs of a particular criticalpoint and offloads the NCLs to other equivalent drivers. The equivalentdriver can have greater drive strength. Between many drivers, block 912selects the one with the most slack. FIG. 17A illustrates two NCLs 1056,1058 coupled to Q of cell 1050. Also a CL 1055 is coupled to Q. Inoffloading, the present invention offloads NCL 1058 to another driver.In this example, the equivalent driver is the IQ output of cell 1050 asshown in FIG. 17B. An inverter 1057 is added to maintain logicalequivalence. Equally possible, both NCLs 1056, 1058 can be offloaded.Block 912 of the present invention is performed at critical points inthe input design domain.

In logic block 914 of FIG. 12, the present invention performs balancingwith respect to critical points of cells and within driver networks.With respect to a sequential cell, the present invention attempts toevenly balance loads across the Q and /Q outputs. This can requiremoving one load to another sequential output. With respect to a drivernetwork, FIG. 15A illustrates a network having a common driver 1081 andsecondary drivers 1083 and 1085. The loads 1085a-f are not balanced withmore loads on driver 1085. The present invention moves a load fromdriver 1085 to driver 1083 to balance the network. The result is shownin FIG. 18B where the network is better balanced. By performing loadbalancing, certain performance constraints can be more readilysatisfied. Block 914 of the present invention is performed at criticalpoints in the input design domain.

In logic block 916, the present invention performs load splitting. Inthis process, drivers having critical loads are duplicated with thecritical load applied to its own driver or a duplicate driver havingincreased drive strength. FIG. 19A illustrates a single driver 1081 witha critical load 1055 and other lines 1087 coupling with other NCLs (notshown). The present invention, as shown in FIG. 19B, duplicates driver1081 and provides driver 1081' which is coupled to the critical load1055. The remainder of the NCLs are coupled via lines 1087 to driver1081. By splitting, certain performance constraints can be more readilysatisfied. Block 916 of the present invention is performed at criticalpoints in the input design domain.

By applying the above eight timing related optimizations to the inputdesign domain, the present invention constraint driven compile provideseffective measures to meet the performance constraints while requiringsubstantially reduced CPU 101 processing time over the prior artincremental compile step. Although a number of procedures can be used,an exemplary procedure is shown below to implement the processing ofFIG. 12:

while (critical point←get₋₋ critical₋₋ point(design))

do

num₋₋ trials=1;

do

trial₋₋ process←get₋₋ trial₋₋ process(design);

if (accept₋₋ trial←accept₋₋ trial(desin, trial₋₋ process))

then

implement₋₋ process(design, trial₋₋ process)

report₋₋ accept₋₋ process(design, trial₋₋ process, num₋₋ trials)

else

num₋₋ trials←num₋₋ trials+1

fi

while (!accept₋₋ trial)

done

wherein:

get₋₋ critical₋₋ point--processes different critical points

get₋₋ trial₋₋ process--selects the next process (e.g., from blocks902-916) and is a design-dependent heuristic.

accept₋₋ trial--decides to accept the trial and is anotherdesign-dependent heuristic

PROCESSING HIERARCHICAL DESIGNS

FIG. 20 illustrates that the present invention TR compiler 625 and thepresent invention modified scan insertion block 645 can beadvantageously used to perform synthesis in a hierarchical fashioninvolving modules and submodules of an integrated circuit design. Thepresent invention is extremely versatile in handling hierarchicalprocessing because the modified scan insertion block 645 (1) acceptsfully scanned, partially scanned or nonscanned netlist as input and also(2) operates efficiently in terms of processing time therefore chiplevel processing is possible without substantial delay.

Since the modified scan insertion block 645 accepts fully and partlyscanned sequential cells, a number of different processes can outputnetlists to block 645. For those netlists that are fully scan replaced,block 645 performs no sequential translation or replacement but appliesother procedures as described in FIG. 10 (such as loopback connectionremoval, scan chain routing, and optimization). For those netlists thatare unscanned or partially scanned, block 645 performs the above tasksand also performs sequential replacement.

As shown in FIG. 20, a chip design can be represented by a number ofmodules 1001, 1003, 1005, and 1006. Alternatively, a single module canbe represented by submodules 1001, 1003, 1005, and 1006. FIG. 20illustrates an exemplary circuit synthesis using hierarchicalapproaches. Module A, in this example, is executed through the TRcompiler 625 of the present invention that produces a fully scanreplaced but unscannable netlist 1015. This netlist 1015 containsscannable sequential cells having loopback connections. A DFT DRCprocess can be run on netlist 1015 to mark violated cells. For reasonsleft up to designers, module 1001 is not processed on the module levelby the modified scan insertion logic so the result is sent to block1025.

Module 1003, like module 1001 is executed through the TR compiler 625 ofthe present invention which produces a fully scan replaced 1017 netlistthat is unscannable due to the loopback connections 440. A DFT DRCprocess can be run at this point to mark violated cells. The result isthen processed by the modified scan insertion block 645 of the presentinvention. This process 645 will unscan violated cells. The result ofblock 645 is a scannable netlist 1021 meeting the defined constraints.This is forwarded to block 1025.

Module 1005 of FIG. 20 is processed through a compiler 225 that does notprovide any scan replacement or DFT optimization. The result is anonscanned netlist 1019 that does not contain scannable sequential cellsor scan chains. This result 1019 is then fed through the modified scaninsertion block 645 of the present invention which will providesequential translation and construction of scan chains as well asoptimization as shown in FIG. 10. The result 1023 is a scannable netlisthaving scan chains and scannable memory cells. This netlist 1023 can berun through a DFT DRC to mark violated cells. This result is thenforwarded to block 1025.

Lastly, module 1006 consists of a compiled design through a compilerwithout sequential replacement. Therefore, module 1006 containsunscanned cells that are technology dependent. This result is directlyforwarded to block 1025.

At the chip level, logic block 1025 builds a composition or chip levelnetlist 1027 combining the data received from all of the modules 1001,1003, 1005, and 1006. The composition of modules step within block 1025can be performed implicitly as a result of 1023 can be run through a DFTDRC to mark violated cells. The result is then forwarded to block 1025.

Lastly, module 1006 consists of a compiled design through a compilerwithout sequential replacement. Therefore, module 1006 containsunscanned cells that are technology dependent. This result is directlyforwarded to block 1025.

At the chip level, logic block 1025 builds a composition or chip levelnetlist 1027 combining the data received from all of the modules 1001,1003, 1005, and 1006. The composition of modules step within block 1025can be performed implicitly as a result of processing each modulethrough their module level processing. Also at block 1025, certainbinding or linking functions are performed on the design. Bindingconnects the ports of modules together as needed and also connects portsas needed at the chip level design.

Composition block 1025 also accepts as input a nonscannable netlist withscannable cells but no loop back connections 440. The netlist 1027contains some fully scan replaced portions and some completely unscannedportions. The chip level netlist 1027 is then input to the modified scaninsertion block 645 which produces a chip level scannable netlist 1029.Block 645 unscans violated cells. Block 645 of the present inventionpreserves the scan chains constructed within the design associated withnetlists 1021 and 1023 while deriving original scan chains for thedesign associated with netlist 1015 and module 1006. These scan chainsare then linked, as necessary, to provide chip level DFTimplementations. It is appreciated that block 645 can also accept a chiplevel netlist including a portion comprising a nonscannable netlist withscannable cells but no loop back connections 440.

It is appreciated that the present invention modified scan insertionlogic block 645 is advantageously suited for chip level scan insertionbecause it operates with substantially reduced processing time so thatan entire chip level netlist can practically be processed. In otherwords, a time consuming full or incremental compile does not need to beperformed on the chip level netlist 1027. Further, since block 645accepts netlists with scanned and nonscanned sequential cells, it issuited for receiving input from a variety of different compiler options.

It is appreciated that designs 1001, 1003, 1005 and 1006 can also besubdesigns of a single module. In this case, block 1025 will process atthe module level while the above tasks process at the submodule level.

PARTIAL UNSCAN AND NEAR FULL SCAN

FIG. 21A and FIG. 21B are top level flow diagrams of processes using thesubtractive partial unscan procedure and the additive near full scanprocedure of the present invention. These procedures are used todetermine a set of sequential cells that can be scan replaced within aninput design while still maintaining certain specified optimizationconstraints (e.g., area and/or timing) of an input circuit design. Thesubtractive partial unscan procedure 1150 (FIG. 21A) can be used forcircuit designs that do not meet the given optimization constraints evenafter the constraint driven scan insertion process 645 (FIG. 8) isexecuted or after iterations of the prior art secondary compile process217 (FIG. 1) are executed. The additive near full scan procedure 1270(FIG. 21B) can be used with an unscanned netlist of a prior art compileror of an imported netlist. The additive near full scan procedure 1270can be used as a time effective scan replacement procedure for designsthat are expected to have challenges meeting the given optimizationconstraints when DFT elements are added. An implementation also uses theadditive near full scan and the partial unscan in combination, asdescribed in more detail in FIG. 24.

With reference to FIG. 21 A, high level process 1200 is an exemplaryusage of the subtractive partial unscan process 1150 of the presentinvention. The steps and substeps of process 1200 are implemented asprogram code stored within a computer readable memory unit (e.g., unit102, or unit 103, or unit 104 of FIG. 2) of a computer system. Process1150 accesses, receives as input a netlist 630 that is fully scan replaced. In one example, this fully scan replaced netlist 630 originatesfrom the test ready (TR) compiler 625 of the present invention, but canoriginate elsewhere. The TR compiler 625 of the present inventiongenerates a netlist 630 that is non-scannable due to the loopbackconnections 440 associated with the sequential cells. However, thenon-scannable netlist 630 is nevertheless fully scan replaced since allsequential cells are scan replaced. It is appreciated that procedure1150 can accept a fully scan replaced netlist originating from a varietyof different procedures and sources (e.g., refer to FIG. 24) and thatthe non-scannable netlist 630 originating from the TR compiler 625 ofthe present invention is but one example. For application of procedure1150, it is assumed that the input fully scan replaced netlist 630 doesnot meet the given optimization constraints (e.g., area and/or timing)of the design. A ranked list, called a testability cell list (TCL) 1157,to be described further below, is accessed by process 1150 during itssubtractive procedure.

As discussed in more detail with respect to FIG. 22A, the subtractivepartial unscan process 1150 of the present invention selectively unscanscertain scan replaced sequential cells of the input netlist 630 untilthe overall design meets the given optimization constraints (e.g.,timing and area elements of constraints 610 of FIG. 8). The result is apartially scanned netlist 1220. This netlist 1220 is partially scannedbecause certain scan replaced cells from netlist 630 become and remainunscanned after process 1150. In the example of FIG. 21A, the resultantpartially scanned netlist 1220 is called a "non-scannable netlist"because of the loopback connections 440 of the scan replaced cells.

With reference to FIG. 21B, high level process 1250 is an exemplaryusage of the additive near full scan process 1270 of the presentinvention. The steps and substeps of process 1250 are implemented asprogram code stored within a computer readable memory unit (e.g., unit102, or unit 103, or unit 104 of FIG. 2) of a computer system. Process1270 accesses, receives an unscanned netlist 1260 as input. Theunscanned netlist 1260 typically originates from a prior art compiler(e.g., block 225 of FIG. 1) and can be an imported netlist (including amanually created netlist). The additive near full scan process 1270 ofthe present invention selectively modifies the original input netlist1260 by selectively scanning certain unscanned sequential cells. Thisprocess is iterative until the modified design does not meet itsoriginal area and/or timing constraints. The sequential cells selectedfor scan replacement are based on those cells that contribute most totestability. At the completion of process 1250, a nonscannable netlist1280 is generated. The netlist 1280 is non-scannable because, likenetlist 1220 of FIG. 21A, the scan replaced cells have not been linkedtogether in scan chains. The ranked list or testability cell list (TCL)1157, to be described further below, is also accessed by process 1270.

It is appreciated that netlists 1220 and 1280 of FIG. 21A and FIG. 21Bcan be input to the constraint driven scan insertion procedure 645 (FIG.10) of the present invention to generate scannable netlists. Process 645breaks the loopback connections 440 while constructing the scan chainsbetween scan replaced sequential cells. Those cells remaining unscannedwithin netlist 1220 and 1280 can be appropriately marked as violated orvalid nonscan before the netlist is input to process 645 and are notscan replaced by process 645 nor included in the resulting scan chains.Cells marked violated indicate that the cells violates design ruleconstraints while valid non-scanned indicates that the cell does notviolate design rule constraints but is not being scanned to meetoptimization constraints.

With reference to FIG. 22A, the subtractive partial scan process 1150 isillustrated. Process 1150 is typically performed by embodiments of thepresent invention in those cases where the optimization procedures ofthe modified scan insertion block 645, or any other optimization, areunable to satisfy area, timing constraints and (Other performancerelated constraints. In such case, the user selects process 1150 whichdetermines certain scan replaced sequential cells to unscan usingdifferent approaches to satisfy area and/or timing constraints. Beforeentry of process 1150, the user is allowed to assert a timing criticalflag which indicates whether or not timing is to be optimized along witharea optimization. Process 1150 continues to unscan certain scanreplaced sequential cells until the constraints are satisfied or untilanother user selected termination point is reached. At the completion cfprocess 1150, the maximum amount of scannable cells are left in thecircuitry while satisfying area and performance constraints. Process1150 executes within the CAD computer system 112 (FIG. 2) and the stepsof process 1150 are stored as program code in computer readable memoryunits of system 112.

Process 1150 starts at logic block 1153 where a timing critical flag(e.g., stored in the input netlist 630) is accessed and checked. Thetiming critical flag indicates whether or not a user has selected thattiming constraints should be checked during the subtractive process1150. At block 1153, if the timing critical flag is not set, thenprocessing flows to logic block 1170 where area constraints areexamined, otherwise, processing flows to logic block 1155. At block1155, the present invention examines the input fully scan replacednetlist (e.g., netlist 630) and determines a set of critical paths (or asingle critical path) within the input netlist 630 that include scanreplaced cells coupled therewith. Critical paths represent criticaltiming paths of the network because they do not satisfy performance(e.g., timing) constraints.

A critical path is composed of two end points and typically each endpoint consists of a sequential cell (e.g. a first and a secondsequential cell). FIG. 22B illustrates an exemplary critical path fromsequential cell 1198 through combinational logic 1193 and to asequential cell 1196. A critical path can have both of its sequentialcells (e.g., 1198 and 1196) scan replaced, or only one of its sequentialcells scan replaced, or neither of its sequential cells scan replaced. Acritical path having at least one of its sequential cells scan replacedis called a critical path with a scan replaced (e.g., scanned) cell.

At logic block 1155 of FIG. 22A, a set of critical paths are determinedby performing a timing analysis (e.g., a static timing analysis) on theinput netlist design. The static timing analysis can be unit timingbased or cycle based and many well known methods of performing statictiming analysis can be performed in accordance with the presentinvention at logic block 1155. Those paths not meeting timingconstraints as measured through the static timing analysis procedure aremarked as critical paths of the input design. The critical paths can beranked by their criticality. Those critical paths that have the worsetiming performance and said to be the worst critical of the criticalpaths.

At logic block 1160 of FIG. 22A four conditions are checked in anyorder. First, if there are no critical paths with a scan replaced cellto be determined, but critical paths remain in the input design, thenprocessing flows from block 1160 to block 1170 and a message isgenerated indicating that timing constraints are not satisfied and thereis no more unscanning that can be accomplished within the design.Second, if there remain critical paths with scan replaced cells in thedesign, but there are worse critical paths also remaining in the designthat do not have any scan replaced cells, processing flows from block1160 to block 1170 and a message is generated indicating that timingconstraints are not satisfied and will not improve by unscanning furthercritical paths since worse paths exist that have no scan replaced cells.Third, at block 1160, if there are no critical paths left at all, thenprocessing flows from block 1160 to block 1170 and a message isgenerated indicating that timing constraints are satisfied. At block1170, area constraints are checked. Fourth, if a critical path with ascan replaced cell exists or a set of critical paths each with a scanreplaced cell exist, then processing flows from block 1160 to logicblock 1165 and timing optimization continues.

Logic block 1165 interfaces with and uses a testability cell ranked listor "TCL ranked list" 1157, residing in computer readable memory unit 102or 104 (FIG. 2). The TCL ranked list 1157 originates from a procedurethat ranks each memory cell in the netlist by order of its importancewithin the mission mode circuitry. This ranking can also be viewed as aranking of individual cells and their contribution to testability. Anumber of well known methods and procedures can be used to generate theTCL ranked list 115, in accordance with the present invention and theresult is a ranked list of sequential cells with their associated TCLrank number. The following references describe some of the well knownways in which the TCL ranked list 1157 can be generated: E. Trischler,"Incomplete Scan Path with an Automatic Test Generation Methodology," inProc. Int. Test Conf., pp. 153-162, November 1980; K. S. Kim and C. R.Kime, "Partial Scan by User of Empirical Testability" in Proc. Int.Conf. Computer-Aided Design, pp. 314-317, November 1990; M. Abramovici,J. J. Kulikowski, and R. K. Roy, "The Best Flip-flops to Scan," in Proc.Int. Test Conf., pp. 166-173, October 1991; and D. H. Lee and S. M.Reddy, "On Determining Scan Flip-flops in Partial Scan Designs," inProc. Int. Conf. on Computer-Aided Design, pp. 322-325, November 1990;and several other references (e.g., [1], [5-13], and [15-18]) cited byI. Park, D. S. Ha, and G. Sim, in "A New Method for Partial Scan DesignBased on Propagation and Justification Requirements of Faults," Proc.Int. Test Conf., pp. 413-422, 1995, paper 19.2.

Within the TCL ranked list 1157, those sequential cells that do notcontrol a relatively large amount of combinational logic and/or thosecells that do not observe a relatively large amount of combinationallogic are placed at the bottom of the TCL ranked list 1157 and havelower TCL rank numbers. Those sequential cells that control a relativelylarge amount of combinational logic and/or observe a relatively largeamount of combinational logic are placed at the top of the TCL rankedlist 1157 and have higher TCL rank numbers. It is more advantageous toscan replace cells with higher TCL rank numbers (e.g., lower in thelist) because these cells contribute more to testability due to theamount of circuitry they control and/or observe. Alternatively, it isless advantageous to scan replace cells with lower TCL rank numbers(e.g., higher in the list).

It is appreciated that within the present invention the convention canbe reversed of assigning higher TCL rankings to sequential cells thatcontribute highly to testability and assigning lower TCL rankings to theother sequential cells.

At logic block 1165 of FIG. 22A, the present invention selects aparticular critical path having a scan replaced cell of those criticalpaths identified in block 1160. In an exemplary implementation of thepresent invention, at block 1165, those critical paths with the worseperformance are selected first.

At block 1165, the present invention determines the specific sequentialcells coupled with the particular selected critical path. An exemplarycritical path 1190 is shown in FIG. 22B where both sequential cellscoupled to the particular critical path are scan replaced. The scanreplaced sequential cells coupled in path 1190 are cells 1198 and 1196.These sequential cells are also ranked in TCL ranked list 1157. TheirTCL rank numbers are shown in quotes with 1198 at "6," and 1196 at "2."In this case, for this critical path 1190 having two scan replacedsequential cells, the present invention at logic block 1165 searchesthrough the TCL ranked list 1157 to determine the scan replacedsequential cell with the lowest rank number; this scan replacedsequential cell is determined to have the smallest impact on, orcontribution to, testability due to its low rank number compared to theother cells within the critical path 1190. This selected cell is thenunscanned. Therefore, at logic block 1165 of FIG. 22A, the presentinvention unscans scan replaced sequential cell 1196, as shown by thedotted line, in an attempt to meet timing constraints 610.

At logic block 1165, if the selected particular critical path containsan unscanned and a scan replaced sequential cell on either ends of thepath, the scan replaced sequential cell is unscanned regardless of TCLranking.

Flow then returns to logic block 1155 of FIG. 22A where the input designis examined for critical paths again, if any. If another critical pathwith a scanned replaced cell is determined at logic block 1160 (andthere are no worse critical paths having only unscanned cells), thenlogic block 1165 is again introduced to continue timing optimization.

At logic block 1170, the present invention checks area constraints. Whenthe timing constraints flag is not asserted or timing constraints drivenunscanning completes (e.g., logic blocks 1155, 1160, and 1165), thepresent invention goes on to satisfy area constraints. To satisfy areaconstraints, the present invention at logic block 170 sums the areaconsumed by logic cells and routing resources of the input design andchecks if the area constraints 610 are violated by this summation. Ifarea constraints are violated, then processing flows to logic block1172, else process 1150 exits. At logic block 1172, the presentinvention accesses the TCL ranked list 1157 and starts at the end of theTCL ranked list 1157 having the lowest rank number and selects the scanreplaced cell having the lowest rank number. At block 1172, the selectedscan replaced cell having the lowest rank number is then unscanned.After unscanning an individual cell, the present invention flows tologic block 1174 where the TCL ranked list 1157 is checked to determineif it contains any more scan replaced cells. If not, process 1150returns, else processing flows to block 1170 where the present inventionrecomputes the area summation of the design and checks this newsummation value against the area constraints 610. This subprocesscontinues until the area constraints are satisfied or until the TCLranked list 1157 is empty; in either situation process 1150 returns. Ifthe TCL ranked list 1157 becomes empty and area constraint have not beenmeet, process 1150 returns and indicates that area constraints could notbe satisfied.

At the completion of process 1150, a partially unscanned netlist 1220(FIG. 21 A) results. The resulting netlist 1220 typically meets timingand area constraints. However, in extreme cases, netlist 1220 can becompletely unscanned but there still remains critical paths with no scanreplaced cells; in such case, timing constraints 610 are not met and anerror message is indicated. Also, if area constraints are not satisfied,as discussed above, an error message is indicated. The resultant netlist1220 generated by the processing 1150 of FIG. 22A is then stored in acomputer memory, e.g., RAM 102 or device 104 (FIG. 2).

With reference to FIG. 23A, the additive near full scan procedure 1270of the present invention is shown. Process 1270 is executed on computersystem 112 (FIG. 2) and the steps of process 1270 are stored as programcode in computer readable memory units of system 112. At logic block1310, process 1270 receives, accesses an unscanned netlist (e.g.,netlist 1260 of FIG. 21B). The unscanned netlist received at logic block1310 is called the original netlist design. At block 1310, process 1270also receives timing information indicative of the worse timing path ofthe input design netlist 1260. At logic block 1320, the presentinvention receives, accesses the TCL ranked list 1157, e.g., the sameTCL ranked list 1157 described above, which is a ranked list indicatingsequential cells and their relative contribution to testability. WithinTCL ranked list 1157, those cells with higher contributions totestability have higher ranking numbers and those cells with comparablyless contributions to testability have lower ranking numbers.

Process 1270 flows to logic block 1350 of FIG. 23B. At block 1350, thepresent invention additive process 1270 accesses the unscannedsequential cell of the TCL ranked list 1157 having the highest ranknumber. At logic block 1360, the original netlist design is copied in analternate portion of memory 102 (FIG. 2) as a design copy and theselected unscanned sequential cell within the design copy is then scanreplaced. The scan replacement step of logic block 1360 is analogous tosteps 750 through 755 (FIG. 9) of the TR compiler process 625.

At logic block 1362, the present invention determines whether the areaconstraints 610 are violated by the design copy created in block 1360.At block 1362, the area consumed by each logic element and the routingresources are summed and this summed value is compared against thepredetermined area constraints 610. If area constraints are not violatedby the design copy then processing flows to logic block 1364. If areaconstraints are violated by the design copy then processing flows tologic block 1400.

At logic block 1364, the present invention checks if a timing criticalflag is set within the input design 1260. If the timing critical flag isset, then the present invention insures that each scan replaced celldoes not cause the design copy to violate the predetermined timingconstraints 610. If the timing critical flag is set, processing flows toblock 1370. If the timing critical flag is not set, processing flows tologic block 1390 where the currently selected unscanned cell is scannedin the original design.

At logic block 1370 of FIG. 23B, the present invention then performs atiming analysis (e.g., static timing analysis) on the design copy havingthe most recently scan replaced memory cell. A number of well knownstatic timing analysis procedures can be used at logic block 1370 andcan include a unit timing based or cycle based analysis. From theinformation gained at logic block 1370, the present invention generatesa set of paths within the design copy netlist including critical pathsand less critical paths; the paths are ranked based on their criticality(e.g., their comparable performance). Located at the top of the list setare the worst critical paths, e.g., those circuit paths (e.g., betweensequential cells) that require the longest settling time to process.Located on the bottom of the path set are less critical paths. Thecritical paths indicate those paths that do not meet timing constraintsor that require the longest settling time of the paths of the netlist;these paths are stored in memory 102 or 104 of system 112 (FIG. 2) atlogic block 1370. The overall timing of the design copy is based onthese worst paths. More than one critical path can share the same timingvalue.

Within logic block 1370, after the timing analysis, the presentinvention identifies the worse path within the design copy. At logicblock 1380, the present invention determines if the worse path of thedesign copy does not meet timing constraints, e.g., has worse timingthan the original design received at block 1310. If this is the case,processing flows to logic block 1400 and the selected unscanned cell isnot scan replaced within the original design. If the determination oflogic block 1380 is false, then at logic block 1390 the selectedunscanned cell is scan replaced in the original design and the originaldesign now includes the scan replaced selected cell. It is appreciatedthat the scan replacement step of logic block 1390 is analogous to steps750 through 755 (FIG. 9) of the TR compiler process 625.

After block 1390, logic block 1400 is executed. At logic block 1400, thepresent invention checks if the selected unscanned sequential cell wasthe last unscanned cell within the TCL ranked list 1157. If so,processing of the additive process 1270 exits. If the selectedsequential cell was not the last unscanned sequential cell in the TCLranked list 1157, then at logic block 1410 the next unscanned sequentialcell in rank (order within the TCL ranked list 1157 is selected.Processing then returns to logic block 1361) with the newly selectedcell being scan replaced in the new design copy (the previous designcopy can be overwritten by the new design copy).

This process continues until all unscanned sequential cells (or apredefined subset of cells) within the TCL ranked list 1157 areprocessed. The result is a partially scan replaced netlist thatsatisfies the determined optimization (e.g., area and/or timing)constraints.

It is appreciated that the partially scan replaced netlist that resultsfrom the additive process 1270 can be input to the constraint drivenscan insertion process 645 of the present invention. Those cellsremaining unscanned can be marked as valid non-scanned before input toprocess 645 and will not be scan replaced by process 645 nor included inthe resulting scan chains.

FIG. 24 is a flow diagram illustrating an embodiment of the presentinvention using near full scan 1270 as a front end for the partialunscan process 1150. Process 1510 is useful for imported unscannednetlists (e.g., netlists that originate from a prior art compiler or aremanually generated) and it is desired to transform these importednetlists into a form that imitates the output of the test ready compiler625. At step 1525, process 1510 receives an imported netlist havingunscanned sequential cells. This imported netlist is fed into the nearfull scan process 1270 and the optimization constraints 610 areeliminated during process 1270 so that all unscanned sequential cells ofthe imported netlist are scan replaced regardless of any optimization(e.g., timing and area) constraint violations. Du ring process 1270, aTCL ranked list 1157 is constructed and all unscanned sequential cellsare scan replaced to produce an resultant imported netlist. Theresultant imported netlist is a fully scan replaced netlist and somewhatimitates the output produced by the test ready compiler 625 in that allsequential cells are scan replaced, except no loopback connections arepresent within the resultant imported netlist. An alternative step atthis point (in lieu of using process 1270 without constraints) is toinput the netlist and scan replace all sequential cells and then installloopback connections. The result of this alternative also mimics theoutput produced by the test ready compiler 625.

After process 1270 (or its alternative), the fully scan replacedresultant imported netlist is fed into the normal partial unscan process1150 with optimization constraints (e.g. timing and area) restored tonormal. Partial unscan 1150 then backs off of the scan replacementperformed during step 1270 if timing and area violations exist. At thecompletion of process 1150, a resultant netlist is generated and storedat step 1530. Ideally, at the completion of step 1150, the importednetlist meets given timing and area constraints. As discussed above,timing critical flags can be set with respect to processes 1150 to checkfor or ignore timing.

The preferred embodiment of the present invention, a subtractive partialscan procedure and an additive near full scan procedure are describedfor determining a set of memory cells that can be scan replaced to offersignificant testability while meeting given timing constraints of thedesign. While the present invention has been described in particularembodiments, it should be appreciated that the present invention shouldnot be construed as limited by such embodiments, but rather construedaccording to the below claims.

What is claimed is:
 1. In a computer implemented synthesis system, asubtractive method of generating a netlist containing scan replacedsequential cells and satisfying determined optimization constraints,said method comprising the computer implemented steps of:a) accessing aninput netlist containing scan replaced sequential cells; b) determininga set of critical paths within said input netlist; c) selecting aselected critical path of said set of critical paths and identifying afirst sequential cell and a second sequential cell located on either endof said selected critical path; d) provided said first sequential celland said second sequential cell are both scan replaced, determiningwhich sequential cell of said first and second sequential cellscontributes least to testability; e) within said input netlist,unscanning said sequential cell of said selected critical path thatcontributes least to testability; and f) repeating steps b)-e) whilethere are critical paths in said set of critical paths that have scanreplaced cells and there are no worse critical paths that have no scanreplaced cells.
 2. A method as described in claim 1 wherein said inputnetlist is a fully scan replaced netlist including a loopback connectionassociated with each scan replaced sequential cell.
 3. A method asdescribed in claim 1 wherein said optimization constraints includetiming constraints and wherein said step b) comprises the steps of:b1)performing a timing analysis on said input netlist; b2) identifyingpaths within said input netlist that do not satisfy said timingconstraints based on said timing analysis; and b3) storing said paths assaid set of critical paths.
 4. A method as described in claim 1 furthercomprising the steps of:determining if one of said first and said secondsequential cells is unscanned and if the other sequential cell is scanreplaced; and if said one sequential cell is unscanned and said othersequential cell is scan replaced, unscanning said scan replacedsequential cell regardless of any contribution to testability of saidscan replaced sequential cell.
 5. A method as described in claim 1wherein said optimization constraints include timing and areaconstraints and further comprising the steps of:g) provided areaconstraints are violated by said input netlist, identifying a selectedscan replaced cell having a lowest contribution to testability andunscanning said selected scan replaced cell within said input netlist;and h) repeating step g) until all cells are unscanned or until saidarea constraints are satisfied by said input netlist.
 6. In a computersystem having a processor coupled to a bus and a memory coupled to saidbus, a computer implemented subtractive method of generating a netlisthaving scan replaced sequential cells and satisfying determined timingand area constraints, said method comprising the computer implementedsteps of:a) provided a timing critical selection is asserted, unscanningscan replaced sequential cells that violate said timing constraints,said step a) comprising the steps of:a1) accessing a fully scan replacedinput netlist; a2) performing a timing analysis on said input netlist todetermine a set of critical paths located therein; a3) selecting aparticular critical path of said set of critical paths and identifying afirst sequential cell and a second sequential cell located on either endof said particular critical path; a4) provided said first and secondsequential cells are both scan replaced, determining which contributesleast to testability; a5) within said input netlist, unscanning saidsequential cell of said particular critical path that contributes leastto testability; and a6) repeating steps a2)-a5) while critical pathshaving scan replaced cells exist within said set of critical paths andno worse critical paths exist that have no scan replaced cells; b)provided said area constraints are violated by said input netlist,selecting a particular scan replaced cell having a low contribution totestability and unscanning said particular scan replaced cell; and c)repeating step b) until all cells are unscanned or until said areaconstraints are satisfied.
 7. A method as described in claim 6 whereinsaid step a2) comprises the steps of:performing a static timing analysison said input netlist; identifying paths within said netlist that do notsatisfy said determined timing constraints based on said static timinganalysis; and storing said paths as said set of critical paths.
 8. Amethod as described in claim 6 further including the stepsof:determining if one of said first and second sequential cells isunscanned and if the other sequential cell of said first and secondsequential cells is scan replaced; and if said one sequential cell isunscanned and said other sequential cell is scan replaced, unscanningsaid scan replaced sequential cell regardless of any contribution totestability of said scan replaced sequential cell.
 9. In a computersystem, an additive method of generating a netlist having scan replacedsequential cells and satisfying determined timing constraints, saidmethod comprising the computer implemented steps of:a) accessing anunscanned input netlist including unscanned sequential cells and havingan indication of its worst critical path; b) selecting a particularunscanned sequential cell characterized in that it has highcontributions to testability; c) scanning said particular unscannedsequential cell within said input netlist provided said step of scanningdoes not worsen timing characteristics of said input netlist; d)selecting a next particular unscanned sequential cell; and e) repeatingsteps c)-d) for each unscanned sequential cell.
 10. A method asdescribed in claim 9 wherein said step c) further comprises the stepsof:c1) copying said input netlist to generate an input netlist copy; c2)scanning said particular unscanned sequential cell within said inputnetlist copy; c3) determining the worst critical path of said inputnetlist copy; and c4) scanning said particular unscanned sequential cellwithin said input netlist provided said worst critical path of saidinput netlist copy is not worse than said worse critical path of saidinput netlist.
 11. A method as described in claim 10 wherein said stepof c3) determining the worst critical path of said input netlist copycomprises the step of performing a static timing analysis on said inputnetlist copy.
 12. A method as described in claim 10 wherein said step ofc4) scanning said particular unscanned sequential cell within said inputnetlist provided said worst critical path of said input netlist copy isnot worse than said worse critical path of said input netlist comprisesthe step of replacing said particular unscanned sequential cell with afunctionally equivalent scannable sequential cell.
 13. In a computersystem, an additive method of generating a netlist having scan replacedsequential cells and satisfying determined timing and area constraints,said method comprising the computer implemented steps of:a) accessing anunscanned input netlist including unscanned sequential cells and havingan indication of its worst critical path; b) selecting a particularunscanned sequential cell characterized in that it has highcontributions to testability; c) scanning said particular unscannedsequential cell, provided said step of scanning does not worsen timingcharacteristics or violate timing constraints of said input netlist,wherein said step c) further comprises the steps of:c1) copying saidinput netlist to generate an input netlist copy; c2) scanning saidparticular unscanned sequential cell within said input netlist copy; c4)determining the worst critical path of said copy of said input netlist;c5) summing the areas of each logic and routing element of said inputnetlist copy to determine an area of said input netlist copy; and c6)scanning said particular unscanned sequential cell within said inputnetlist provided said worst critical path of said input netlist copy isnot worse than said worse critical path of said input netlist andprovided said area of said input netlist copy does not violate said areaconstraints; d) selecting a next particular unscanned sequential cell;and e) repeating steps c)-d) for each unscanned sequential cell.
 14. Amethod as described in claim 13 wherein said step c4) comprises the stepof performing a static timing analysis on said input netlist copy.
 15. Amethod as described in claim 13 wherein said step c6) comprises the stepof replacing said particular unscanned sequential cell by a functionallyequivalent scannable sequential cell.
 16. A computer system comprising:aprocessor coupled to a bus; and a computer readable memory unit coupledto said bus, said memory unit having a program stored therein causingsaid computer system to generate a netlist having scan replacedsequential cells and satisfying determined timing and area constraints,said program causing said processor to perform the steps of: a)accessing an unscanned input netlist including unscanned sequentialcells and having an indication of its worst critical path; b) selectinga particular unscanned sequential cell characterized in that it has highcontributions to testability; c) scanning said particular unscannedsequential cell, provided said step of scanning does not worsen timingcharacteristics or violate timing constraints of said input netlist,wherein said step c) further comprises the steps of:c1) copying saidinput netlist to generate an input netlist copy; c2) scanning saidparticular unscanned sequential cell within said input netlist copy; c4)determining the worst critical path of said copy of said input netlist;c5) summing the areas of each logic and routing element of said inputnetlist copy to determine an area of said input netlist copy; and c6)scanning said particular unscanned sequential cell within said inputnetlist provided said worst critical path of said input netlist copy isnot worse than said worse critical path of said input netlist andprovided said area of said input netlist copy does not violate said areaconstraints; d) selecting a next particular unscanned sequential cell;and e) repeating steps c)-d) for each unscanned sequential cell.